Compositions and methods for the therapy and diagnosis of ovarian cancer

ABSTRACT

Compositions and methods for the therapy and diagnosis of cancer, such as ovarian cancer, are disclosed. Compositions may comprise one or more ovarian tumor proteins, immunogenic portions thereof, or polynucleotides that encode such portions. Alternatively, a therapeutic composition may comprise an antigen presenting cell that expresses an ovarian tumor protein, or a T cell that is specific for cells expressing such a protein. Such compositions may be used, for example, for the prevention and treatment of diseases such as ovarian cancer. Diagnostic methods based on detecting an ovarian tumor protein, or mRNA encoding such a protein, in a sample are also provided.

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application is related to U.S. Provisional Applications 60/211,457 filed Jun. 13, 2000; 60/207,107 filed May 24, 2000; 60/213,673 filed Jun. 21, 2000; 60/223,288 filed Aug. 3, 2000; and 60/272,790 filed Mar. 1, 2001, incorporated in their entirety herein by reference.

TECHNICAL FIELD OF THE INVENTION

[0002] The present invention relates generally to therapy and diagnosis of cancer, such as ovarian cancer. The invention is more specifically related to polypeptides comprising at least a portion of an ovarian tumor protein, and to polynucleotides encoding such polypeptides. Such polypeptides and polynucleotides may be used in vaccines and pharmaceutical compositions for prevention and treatment of ovarian cancer and for the diagnosis and monitoring of such cancers.

BACKGROUND OF THE INVENTION

[0003] Ovarian cancer is a significant health problem for women in the United States and throughout the world. Although advances have been made in detection and therapy of this cancer, no vaccine or other universally successful method for prevention or treatment is currently available. Current therapies, which are generally based on a combination of chemotherapy or surgery and radiation, continue to prove inadequate in many patients. The course of treatment for ovarian cancer is often selected based on a variety of prognostic parameters, including an analysis of specific tumor markers. However, the use of established markers often leads to a result that is difficult to interpret, and high mortality continues to be observed in many cancer patients.

[0004] Immunotherapies have the potential to substantially improve cancer treatment and survival. Such therapies may involve the generation or enhancement of an immune response to an ovarian carcinoma antigen. However, to date, relatively few ovarian carcinoma antigens are known and the generation of an immune response against such antigens has not been shown to be therapeutically beneficial.

[0005] In spite of considerable research into therapies for these and other cancers, ovarian cancer remains difficult to diagnose and treat effectively. Accordingly, there is a need in the art for improved methods for detecting and treating such cancers. The present invention fulfills these needs and further provides other related advantages.

SUMMARY OF THE INVENTION

[0006] Briefly stated, the present invention provides compositions and methods for the diagnosis and therapy of cancer, such as ovarian cancer. In one aspect, the present invention provides polypeptides comprising at least a portion of an ovarian tumor protein, or a variant thereof. Certain portions and other variants are immunogenic, such that the ability of the variant to react with antigen-specific antisera is not substantially diminished. Within certain embodiments, the polypeptide comprises a sequence that is encoded by a polynucleotide sequence selected from the group consisting of: (a) sequences recited in SEQ ID NO:1-168 and 173-339; (b) variants of a sequence recited in SEQ ID NO:1-168 and 173-339; and (c) complements of a sequence of (a) or (b). In specific embodiments, the polypeptides of the present invention comprise at least a portion of a tumor protein that includes an amino acid sequence selected from the group consisting of sequences encoded by SEQ ID NO:1-168 and 173-339 and variants thereof.

[0007] The present invention further provides polynucleotides that encode a polypeptide as described above, or a portion thereof (such as a portion encoding at least 15 amino acid residues of an ovarian tumor protein), expression vectors comprising such polynucleotides and host cells transformed or transfected with such expression vectors.

[0008] Within other aspects, the present invention provides pharmaceutical compositions comprising a polypeptide or polynucleotide as described above and a physiologically acceptable carrier.

[0009] Within a related aspect of the present invention, vaccines for prophylactic or therapeutic use are provided. Such vaccines comprise a polypeptide or polynucleotide as described above and an immunostimulant.

[0010] The present invention further provides pharmaceutical compositions that comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to an ovarian tumor protein; and (b) a physiologically acceptable carrier.

[0011] Within further aspects, the present invention provides pharmaceutical compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as described above and (b) a pharmaceutically acceptable carrier or excipient. Antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts and B cells.

[0012] Within related aspects, vaccines are provided that comprise: (a) an antigen presenting cell that expresses a polypeptide as described above and (b) an immunostimulant.

[0013] The present invention further provides, in other aspects, fusion proteins that comprise at least one polypeptide as described above, as well as polynucleotides encoding such fusion proteins.

[0014] Within related aspects, pharmaceutical compositions comprising a fusion protein, or a polynucleotide encoding a fusion protein, in combination with a physiologically acceptable carrier are provided.

[0015] Vaccines are further provided, within other aspects, that comprise a fusion protein, or a polynucleotide encoding a fusion protein, in combination with an immunostimulant.

[0016] Within further aspects, the present invention provides methods for inhibiting the development of a cancer in a patient, comprising administering to a patient a pharmaceutical composition or vaccine as recited above. The patient may be afflicted with ovarian cancer, in which case the methods provide treatment for the disease, or patient considered at risk for such a disease may be treated prophylactically.

[0017] The present invention further provides, within other aspects, methods for removing tumor cells from a biological sample, comprising contacting a biological sample with T cells that specifically react with an ovarian tumor protein, wherein the step of contacting is performed under conditions and for a time sufficient to permit the removal of cells expressing the protein from the sample.

[0018] Within related aspects, methods are provided for inhibiting the development of a cancer in a patient, comprising administering to a patient a biological sample treated as described above.

[0019] Methods are further provided, within other aspects, for stimulating and/or expanding T cells specific for an ovarian tumor protein, comprising contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that expresses such a polypeptide; under conditions and for a time sufficient to permit the stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells prepared as described above are also provided.

[0020] Within further aspects, the present invention provides methods for inhibiting the development of a cancer in a patient, comprising administering to a patient an effective amount of a T cell population as described above.

[0021] The present invention further provides methods for inhibiting the development of a cancer in a patient, comprising the steps of: (a) incubating CD4⁺ and/or CD8⁺ T cells isolated from a patient with one or more of: (i) a polypeptide comprising at least an immunogenic portion of an ovarian tumor protein; (ii) a polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that expressed such a polypeptide; and (b) administering to the patient an effective amount of the proliferated T cells, and thereby inhibiting the development of a cancer in the patient. Proliferated cells may, but need not, be cloned prior to administration to the patient.

[0022] Within further aspects, the present invention provides methods for determining the presence or absence of a cancer in a patient, comprising: (a) contacting a biological sample obtained from a patient with a binding agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount of polypeptide that binds to the binding agent; and (c) comparing the amount of polypeptide with a predetermined cut-off value, and therefrom determining the presence or absence of a cancer in the patient. Within preferred embodiments, the binding agent is an antibody, more preferably a monoclonal antibody. The cancer may be ovarian cancer.

[0023] The present invention also provides, within other aspects, methods for monitoring the progression of a cancer in a patient. Such methods comprise the steps of: (a) contacting a biological sample obtained from a patient at a first point in time with a binding agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) and (b) using a biological sample obtained from the patient at a subsequent point in time; and (d) comparing the amount of polypeptide detected in step (c) with the amount detected in step (b) and therefrom monitoring the progression of the cancer in the patient.

[0024] The present invention further provides, within other aspects, methods for determining the presence or absence of a cancer in a patient, comprising the steps of: (a) contacting a biological sample obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that encodes an ovarian tumor protein; (b) detecting in the sample a level of a polynucleotide, preferably mRNA, that hybridizes to the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the oligonucleotide with a predetermined cut-off value, and therefrom determining the presence or absence of a cancer in the patient. Within certain embodiments, the amount of mRNA is detected via polymerase chain reaction using, for example, at least one oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as recited above, or a complement of such a polynucleotide. Within other embodiments, the amount of mRNA is detected using a hybridization technique, employing an oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as recited above, or a complement of such a polynucleotide.

[0025] In related aspects, methods are provided for monitoring the progression of a cancer in a patient, comprising the steps of: (a) contacting a biological sample obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that encodes an ovarian tumor protein; (b) detecting in the sample an amount of a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) using a biological sample obtained from the patient at a subsequent point in time; and (d) comparing the amount of polynucleotide detected in step (c) with the amount detected in step (b) and therefrom monitoring the progression of the cancer in the patient.

[0026] Within further aspects, the present invention provides antibodies, such as monoclonal antibodies, that bind to a polypeptide as described above, as well as diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more oligonucleotide probes or primers as described above are also provided.

[0027] These and other aspects of the present invention will become apparent upon reference to the following detailed description and attached drawings. All references disclosed herein are hereby incorporated by reference in their entirety as if each was incorporated individually.

BRIEF DESCRIPTION OF THE SEQUENCE IDENTIFIERS:

[0028] TABLE 1 Priority application SEQ ID NO: SEQ Ser. from Priority ID NO: Clone ID No. Application GenBank Homology 1. D52354 60/211,457 1 Novel 2. D54436 60/211,457 2 Novel 3. D54488 60/211,457 3 Novel 4. D58087 60/211,457 4 Novel 5. D58088 60/211,457 5 Novel 6. D58807 60/211,457 6 Novel 7. D58823 60/211,457 7 Novel 8. D58824 60/211,457 8 Novel 9. D52344 60/211,457 9 Novel 10. D52345 60/211,457 10 Novel 11. D54390 60/211,457 11 Novel 12. D36439 60/211,457 12 Novel 13. D38335 60/211,457 13 Novel 14. D50684 60/211,457 14 Novel 15. D50707 60/211,457 15 Novel 16. D38332 60/211,457 16 Novel 17. D38333 60/211,457 17 Novel 18. D38342 60/211,457 18 Novel 19. D38935 60/211,457 19 Novel 20. D38936 60/211,457 20 Novel 21. D38937 60/211,457 21 Novel 22. D38938 60/211,457 22 Novel 23. D38939 60/211,457 23 Novel 24. D38940 60/211,457 24 Novel 25. D38941 60/211,457 25 Novel 26. D38970 60/211,457 26 Novel 27. D38973 60/211,457 27 Novel 28. D38974 60/211,457 28 Novel 29. D41702 60/211,457 29 Novel 30. D41704 60/211,457 30 Novel 31. D19227 60/211,457 31 Novel 32. D22333 60/211,457 32 Novel 33. OVARIAN 60/211,457 33 Novel 34. OVARIAN 60/211,457 34 Novel 35. OVARIAN 60/211,457 35 Novel 36. OVARIAN 60/211,457 36 Novel 37. OVARIAN 60/211,457 37 Novel 38. OVARIAN 60/211,457 38 Novel 39. 32901.1 60/207,107 1 Human neutrophil-activating ENA- 78 prepeptide gene 40. 32900.1 60/207,107 2 collagenase 41. 32889.1 60/207,107 3 Human ribosomal complete repeating unit 28S 42. 32891.1 60/207,107 4 Fibronectin 43. 32892.1 60/207,107 5 KIAA022O 44. 32894.1 60/207,107 6 KIAA0117 45. 32896.1 60/207,107 7 Actin related protein 2/3 complex, subunit 1B (41kD) 46. 32897.1 60/207,107 8 Novel 47. 32898.1 60/207,107 9 Ribosomal protein, large, P1 (RPLP1) 48. 32899.1 60/207,107 10 TCP-1 (t-complex-1) ring complex 49. 30172 60/213,673 1 Novel 50. 32570 60/213,673 2 Novel 51. 32983 60/213,673 3 Novel 52. 43728 60/213,673 4 Novel 53. 67504 60/213,673 5 Novel 54. 31694 60/213,673 6 Novel 55. 32160 60/213,673 7 Novel 56. 32981 60/213,673 8 Novel 57. 32982 60/213,673 9 Novel 58. 33496 60/213,673 10 Novel 59. 33910 60/213,673 11 Novel 60. 33911 60/213,673 12 Novel 61. 33912 60/213,673 13 Novel 62. 18605.1 60/213,673 14 Novel 63. 18602.2 60/213,673 15 Novel 64. 18620.2 60/213,673 16 Novel 65. 19866.1 60/213,673 17 Novel 66. 16904 60/213,673 18 Novel 67. 35858 60/213,673 19 Novel 68. D36458 60/213,673 20 Novel 69. 36480 60/213,673 21 Novel 70. 38334 60/213,673 22 Novel 71. 38906 60/213,673 23 Novel 72. 38922 60/213,673 24 Novel 73. 38923 60/213,673 25 Novel 74. 38932 60/213,673 26 Novel 75. D41703 60/213,673 27 Novel 76. D49696 60/213,673 28 Novel 77. D49700 60/213,673 29 Novel 78. 49702 60/213,673 30 Novel 79. 49706 60/213,673 31 Novel 80. 52358 60/213,673 32 Novel 81. 58212 60/213,673 33 Novel 82. 59083 60/213,673 34 Novel 83. 38964 60/213,673 35 Novel 84. 38965 60/213,673 36 Novel 85. D38966 60/213,673 37 Novel 86. D38967 60/213,673 38 Novel 87. D38968 60/213,673 39 Novel 88. 56274.1 60/223,288 1 Transcription factor ETR103, differentiation 89. 56275.1 60/223,288 2 Ubiquitin carboxyl-terminal hydrolase 90. 56277.1 60/223,288 3 Ubiquitin carboxyl-terminal hydrolase 91. 56278.1 60/223,288 4 Poly(ADP-ribose) polymerase 92. 56279.1 60/223,288 5 Zinc-fingure protein (ZNF185) 93. 56280.1 60/223,288 6 Mucin 1 94. 56281.1 60/223,288 7 ALEX3, differential expression in carcinomas/normals 95. 56283.1 60/223,288 8 Ribosomal RNA, 28S 96. 56284.1 60/223,288 9 CDNA clone KIAA0493 (chrom1) 97. 56285.1 60/223,288 10 novel 98. 56287.1 60/223,288 11 CDNA clone KIAA1416 99. 56288.1 60/223,288 12 Protein tyrosine phosphatase 100. 56289.1 60/223,288 13 Keratin 8 101. 56290.1 60/223,288 14 Small zinc fingure-like protein (TIM10) 102. 56292.1 60/223,288 15 Complement subcomponent C1 103. 56293.1 60/223,288 16 Ubiquitin carboxyl-terminal hydrolase 104. 56295.1 60/223,288 17 Cytokeritin 8 105. 56297.1 60/223,288 18 proceruloplasmin 106. 56298.1 60/223,288 19 Cytochrome b-5 reductase 107. 56299.1 60/223,288 20 Interferone induced protein 56 108. 56300.1 60/223,288 21 U/K upregulated by 1,25- dihydroxyvitamin D3 109. 56301.1 60/223,288 22 Chrom 1q23, clone RP4-809F7 110. 56302.1 60/223,288 23 CDNA clone, PTD010 (pituitary tumor) 111. 56303.1 60/223,288 24 Ubiquitin carboxyl-terminal hydrolase 112. 56304.1 60/223,288 25 Mitofilin, IMMT 113. 56305.1 60/223,288 26 CDNA clone DKFZp43J1114 114. 56307.1 60/223,288 27 ceruoloplasmin 115. 56308.1 60/223,288 28 Sialoucin CD164 116. 56309.1 60/223,288 29 Adaptor protein p150, for P13K 117. 56310.1 60/223,288 30 CDNA clone FLJ20568 (primary epithelial cells) 118. 56311.1 60/223,288 31 Beta-COP homolog 119. 56312.1 60/223,288 32 Chrom. 11, MEMN1 region 120. 56313.1 60/223,288 33 CDNA clone KIAA0068 121. 56314.1 60/223,288 34 Cement gland protein (XAG-2) 122. 56315.1 60/223,288 35 novel 123. 56316.1 60/223,288 36 CDNA clone DKFZp564D116 124. 56317.1 60/223,288 37 Ubiquitin carboxyl-terminal hydrolase 125. 56318.1 60/223,288 38 RAD21 (double strand break repair) 126. 56319.1 60/223,288 39 Keratin intermediate filament precursor 127. 56321.1 60/223,288 40 Ubiquitin carboxyl-terminal hydrolase 128. 56322.1 60/223,288 41 Tetraspan NET-6; transmembrane; pituitary 129. 56323.1 60/223,288 42 Ubiquitin carboxyl-terminal hydrolase 130. 56324.1 60/223,288 43 DEK, a putative oncogene 131. 56325.1 60/223,288 44 novel 132. 56327.1 60/223,288 45 CDNA clone KIAA0159 133. 56328.1 60/223,288 46 Chrom. 11, MEMN1 region 134. 56329.1 60/223,288 47 novel 135. 56330.1 60/223,288 48 Chitinase 3-like 1/cartilage glycoprotein-39 136. 56332.1 60/223,288 49 ORF#1: progesterone receptor, unactive/ORF#2:zinc fingure transcription factor 137. 56333.1 60/223,288 50 CDNA clone CGI-17 138. 56334.1 60/223,288 51 Complement subcomponent C3 139. 56335.1 60/223,288 52 Cytokeritin 8 140. 56336.1 60/223,288 53 PRAME, preferentially expressed Ag of melanoma 141. 56337.1 60/223,288 54 Ki nuclear auto antigen 142. 56338.1 60/223,288 55 Ubiquitin carboxyl-terminal hydrolase 143. 56339.1 60/223,288 56 Keratin 19 144. 56340.1 60/223,288 57 c-myc binding protein, MM-1 145. 56341.1 60/223,288 58 Transcription factor ETR103, differentiation 146. 56342.1 60/223,288 59 restin 147. 56344.1 60/223,288 60 Keratin 19 148. 56345.1 60/223,288 61 Serine protease, secreted 149. 56346.1 60/223,288 62 novel 150. 56347.1 60/223,288 63 Chitinase 3-like 1/cartilage glycoprotein-39 151. 56349.1 60/223,288 64 152. 56351.1 60/223,288 65 novel 153. 56352.1 60/223,288 66 Chitinase 3-like 1/cartilage glycoprotein-39 154. 56353.1 60/223,288 67 HnRNP A2/B1 155. 56354.1 60/223,288 68 Steroid receptor co-activator (SRC- 1) 156. 56355.1 60/223,288 69 CDC10; cell division cycle 10 157. 56356.1 60/223,288 70 Synaptobrevin-like 1 (SYBL1) 158. 56357.1 60/223,288 71 Melanoma-associated Ag MG50 159. 56358. 60/223,288 72 CDNA clone DKFZP434N161 160. 156359.1 60/223,288 73 MM46/GABA-A-receptor- associated 161. 56361.1 60/223,288 74 Keratin intermediate filament precursor 162. 56362.1 60/223,288 75 5-aminoimidazole-4-carboxamide- ribonucleotide 163. 56363.1 60/223,288 76 Chrom 20 164. 56364.1 60/223,288 77 Ubiquitin carboxyl-terminal hydrolase 165. 56365.1 60/223,288 78 Lysyl hydroxylase isoform 2 (PLOD2) 166. 56366.1 60/223,288 79 novel 167. 56367.1 60/223,288 80 cDNA clone 23815 (brain) 168. 56368.1 60/223,288 81 cDNA clone KIAA0391 169. PCR primer 60/223,288 82 — 1 170. PCR primer 60/223,288 83 — 2R + 1 171. PCR primer 60/223,288 84 — 2R + 2 172. PCR primer 60/223,288 85 — 2R + 3 173. 65346 60/272,790 1 Hu.26S proteasome subunit5a non- ATPase,4; antisecretory factor-1 174. 65347 60/272,790 2 Hu.cleavage stimulation factor 77kD subuni 175. 65348 60/272,790 3 novel 176. 65349 60/272,790 4 94% Dog thyroid mRNA for C3VS 177. 65351 60/272,790 5 Hu. transcriptional coactivator ALY 178. 65352 60/272,790 6 Human (plasma) protein S 179. 65353 60/272,790 7 Hu. keratin 18 180. 65353 60/272,790 8 Hu. keratin 18 181. 65354 60/272,790 9 Hu.complement subcomponent binding pro. (C1QBP); P32 subunit of human pre- mRNA splicing factor SF2 182. 65356 60/272,790 10 84%Mu.partial Rabip4 FYVE-finger containing protein 183. 65357 60/272,790 11 Hu. glucocorticoid-induced leucine zipper 184. 67313 60/272,790 12 Hu. major vault protein;sim. to kinesin-like 4 185. 67705 60/272,790 13 Hu.chromosme-assoc.protein-E (CAP-E) 186. 65360 60/272,790 14 portion of Hu.clone RP11-430K20 187. 65362 60/272,790 15 Hu.proteasome 26S subunit p27,non- ATPase, 9 (PSMD9) 188. 65363 60/272,790 16 novel 189. 65364 60/272,790 17 Hu.cDNA FLJ10188 fis, clone HEMBA1004693 190. 65364 60/272,790 18 Hu.cDNA FLJ10188 fis, clone HEMBA1004693 191. 66251 60/272,790 19 Hu.eukaryotic translation initiation factor 4 gamma 192. 67315 60/272,790 20 Hu.ubiquitin specific protease 16, FLJ21451 fis 193. 67657 60/272,790 21 194. 65369 60/272,790 22 MUC1 195. 67316 60/272,790 23 Hu.Ste2O-related serine/threonine kinase 196. 65371 60/272,790 24 Hu. FLJ21480 fis, clone COL05034 197. 65373 60/272,790 25 galectin-3 198. 65376 60/272,790 26 Hu. PIST 199. 65377 60/272,790 27 Hu.hypothetical protein FLJ10955 200. 65379 60/272,790 28 Hu. clone 24627 mRNA sequence 201. 65379 60/272,790 29 Hu. clone 24627 mRNA sequence 202. 66253 60/272,790 30 Hu.alpha 1,2-mannosidase 203. 66255 60/272,790 31 Novel 204. 66255 60/272,790 32 novel 205. 66256 60/272,790 33 Hu.cDNA FLJ10824 fis,FLJ10683 fis;KIAA0592 protein 206. 66256 60/272,790 34 Hu.cDNA FLJ10824 fis,FLJ10683 fis;KIAA0592 protein 207. 66257 60/272,790 35 Hu.poly(A)-binding pro., cytoplasmic 4 (inducible form) 208. 67708 60/272,790 36 Hu.RNA-binding protein BRUNOL2 209. 67710 60/272,790 37 Hu. kinesin 2; kinesin light-chain protein 210. 67319 60/272,790 38 Hu.centromere protein F, mitosin,CENP-F kinetochore 211. 67711 60/272,790 39 Hu.Rho-assoc.coiled-coil protein kinase2(ROCK2) 212. 66266 60/272,790 40 Hu.eukaryotic translation initiation factor 4A, isoform 1 213. 66270 60/272,790 41 Human retinal pigment epithelium 214. 66271 60/272,790 42 Hu.peroxisome receptor 1 215. 66273 60/272,790 43 Hu.laminin, gamma 1 (formerly LAMB2) 216. 66277 60/272,790 44 Hu.TATA element modulatory factor 1 (TMF1) 217. 67712 60/272,790 45 Hu.nuclear protein, ataxia-telangiectasia locus (NPAT) 218. 66285 60/272,790 46 Hu. ADP-ribosyltransferase 219. 66286 60/272,790 47 Hu.putative secreted protein ZSIG13, serine protease 220. 66287 60/272,790 48 89% murine mSin3A (clone pVZmSin3A) 221. 66292 60/272,790 49 Hu.FLJ22272 fis;HepG2 3′ region MboI clone hmd2gO2m3 222. 66292 60/272,790 50 Hu.FLJ22272 fis;HepG2 3′ region MboI clone hmd2g02m3 223. 66294 60/272,790 51 Hu.MUK-binding inhibitory protein (MBIP) 224. 66295 60/272,790 52 Hu. RAD50-2 protein (RAD50) 225. 67400 60/272,790 53 Human mRNA for KIAA0150 gene 226. 67402 60/272,790 54 Hu.mitchndr.tRNA;cDNA:FLJ22981 fis,clone KAT11391 227. 67403 60/272,790 55 Homo sapiens ribosomal protein L8 228. 67405 60/272,790 56 Hu. transmembrane protein 4 (TMEM4) 229. 67408 60/272,790 57 Hu.annexin A2, lipocortin II 230. 67409 60/272,790 58 Hu.nuclear corepressor KAP-1,KRAB- associated 231. 67410 60/272,790 59 Hu. BAC clone RP11-404P12 232. 67412 60/272,790 60 Hu.ubiquitin fusion-degradation 1 protein 233. 67715 60/272,790 61 Mu.zuotin related factor2,Hu.M-phase phosphoprotein 234. 67417 60/272,790 62 Hu.cDNA FLJ11189 fis,FLJ12238 fis,FLJ12470 fis 235. 67421 60/272,790 63 Hu.cDNA FLJ20425 fis, clone KAT02707 236. 67422 60/272,790 64 Hu.cDNA FLJ21378 fis, cloneCOL03256;sorting nexin7 237. 67322 60/272,790 65 Hu.collagenase stimulatory factor (EMMPRIN),basigin 238. 67323 60/272,790 66 Hu.transforming acidic coiled-coil containing protein 3 239. 67326 60/272,790 67 Hu.quinone oxidoreductase homolog (PIG3) 240. 67331 60/272,790 68 Hu.clone 24636;91%Mu.uterine protein (LOC55978) 241. 67717 60/272,790 69 Hu.annexin A1, lipocortin 242. 67720 60/272,790 70 Hu.alpha 1,2-mannosidase 243. 67722 60/272,790 71 Hu. mRNA for KIAA1398 protein 244. 67723 60/272,790 72 novel 245. 67731 60/272,790 73 Hu.cDNA FLJ11173 fis;sim.toestrogen- responsive B box protein 246. 67731 60/272,790 74 Hu.cDNA FLJ11173 fis;sim.toestrogen- responsive B box protein 247. 68556 60/272,790 75 Hu.chromosome-associated protein-C (hCAP-C) 248. 68572 60/272,790 76 Human DNA sequence from clone LA16- 361A3 249. 68563 60/272,790 77 Hu.laminin recptr.1(67kD, ribosomal pro. SA)(LAMR1) 250. 68569 60/272,790 78 Hu.farnesyl diphosphate synthase;GAPDH region 251. 69263 60/272,790 79 Hu.SH3-containing protein EEN and chromatin assemblyfactor-I p150 subunit 252. 69268 60/272,790 80 Hu. villin 2 (ezrin) (VIL2) 253. 69274 60/272,790 81 Hu.clone A9A2BR11 (CAC)n/(GTG)n repeat-containing 254. 69275 60/272,790 82 Hu.clone 24793 ionotropic ATP receptor P2X5b 255. 69276 60/272,790 83 novel 256. 69280 60/272,790 84 Hu.aldo-keto reductase family 1, member B1 257. 69283 60/272,790 85 Hu. talin (TLN) 258. 69286 60/272,790 86 Human pancreatic mucin (tumor) 259. 69689 60/272,790 87 Hu.macrophage galactose-specific lectin (hMAC-2) 260. 69690 60/272,790 88 Human 28S ribosomal RNA gene 261. 69696 60/272,790 89 Hu.trans-Golgi network protein(TGN51) 262. 69703 60/272,790 90 Hu.KIAA0336 gene product 263. 69705 60/272,790 91 Hu.mRNA for KIAA1398 protein 264. 65346 60/272,790 92 265. 65347 60/272,790 93 266. 65349 60/272,790 94 267. 65351 60/272,790 95 268. 65352 60/272,790 96 269. 65353 60/272,790 97 270. 65354 60/272,790 98 271. 65356 60/272,790 99 272. 65357 60/272,790 100 273. 67313 60/272,790 101 274. 67705 60/272,790 102 275. 65362 60/272,790 103 276. 65364 60/272,790 104 277. 66251 60/272,790 105 278. 67313 60/272,790 106 279. 67657 60/272,790 107 280. 65369 60/272,790 108 281. 67316 60/272,790 109 282. 65371 60/272,790 110 283. 65373 60/272,790 111 284. 65376 60/272,790 112 285. 65377 60/272,790 113 286. 65379 60/272,790 114 287. 66253 60/272,790 115 288. 66256 60/272,790 116 289. 66257 60/272,790 117 290. 67708 60/272,790 118 291. 67710 60/272,790 119 292. 67319 60/272,790 120 293. 67711 60/272,790 121 294. 66266 60/272,790 122 295. 66270 60/272,790 123 296. 66271 60/272,790 124 297. 66273 60/272,790 125 298. 66277 60/272,790 126 299. 67712 60/272,790 127 300. 66285 60/272,790 128 301. 66286 60/272,790 129 302. 66287 60/272,790 130 303. 66292 60/272,790 131 304. 66294 60/272,790 132 305. 66295 60/272,790 133 306. 67400 60/272,790 134 307. 67402 60/272,790 135 308. 67403 60/272,790 136 309. 67405 60/272,790 137 310. 67408 60/272,790 138 311. 67409 60/272,790 139 312. 67412 60/272,790 140 313. 67715 60/272,790 141 314. 67417 60/272,790 142 315. 67421 60/272,790 143 316. 67422 60/272,790 144 317. 67322 60/272,790 145 318. 67323 60/272,790 146 319. 67326 60/272,790 147 320. 67331 60/272,790 148 321. 67717 60/272,790 149 322. 67720 60/272,790 150 323. 67722 60/272,790 151 324. 67731 60/272,790 152 325. 68556 60/272,790 153 326. 68572 60/272,790 154 327. 68563 60/272,790 155 328. 68569 60/272,790 156 329. 69268 60/272,790 157 330. 69274 60/272,790 158 331. 69275 60/272,790 159 332. 69280 60/272,790 160 333. 69283 60/272,790 161 334. 69286 60/272,790 162 335. 69696 60/272,790 163 336. 69703 60/272,790 164 337. 69705 60/272,790 165 338. 66262 Hu. Fatty-acid-Coenzyme A ligase, long chain 2 (FACL2) 339. 66269 Hu. Hypothetical protein FLJ20651 340. 66262 protein 341. 66269 protein

DETAILED DESCRIPTION OF THE INVENTION

[0029] As noted above, the present invention is generally directed to compositions and methods for using the compositions, for example in the therapy and diagnosis of cancer, such as ovarian cancer. Certain illustrative compositions described herein include ovarian tumor polypeptides, polynucleotides encoding such polypeptides, binding agents such as antibodies, antigen presenting cells (APCS) and/or immune system cells (e.g, T cells). An “ovarian tumor protein,” as the term is used herein, refers generally to a protein that is expressed in ovarian tumor cells at a level that is at least two fold, and preferably at least five fold, greater than the level of expression in a normal tissue, as determined using a representative assay provided herein. Certain ovarian tumor proteins are tumor proteins that react detectably (within an immunoassay, such as an ELISA or Western blot) with antisera of a patient afflicted with ovarian.

[0030] Therefore, in accordance with the above, and as described further below, the present invention provides illustrative polynucleotide compositions having sequences set forth in SEQ ID NO:1-168 and 173-339, illustrative polypeptide compositions having amino acid sequences encoded by the polynucleotide sequences set forth in SEQ ID NO:1-168 and 173-339, antibody compositions capable of binding such polypeptides, and numerous additional embodiments employing such compositions, for example in the detection, diagnosis and/or therapy of human ovarian cancer.

[0031] POLYNUCLEOTIDE COMPOSITIONS

[0032] As used herein, the terms “DNA segment” and “polynucleotide” refer to a DNA molecule that has been isolated free of total genomic DNA of a particular species. Therefore, a DNA segment encoding a polypeptide refers to a DNA segment that contains one or more coding sequences yet is substantially isolated away from, or purified free from, total genomic DNA of the species from which the DNA segment is obtained. Included within the terms “DNA segment” and “polynucleotide” are DNA segments and smaller fragments of such segments, and also recombinant vectors, including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like.

[0033] As will be understood by those skilled in the art, the DNA segments of this invention can include genomic sequences, extra-genomic and plasmid-encoded sequences and smaller engineered gene segments that express, or may be adapted to express, proteins, polypeptides, peptides and the like. Such segments may be naturally isolated, or modified synthetically by the hand of man.

[0034] “Isolated,” as used herein, means that a polynucleotide is substantially away from other coding sequences, and that the DNA segment does not contain large portions of unrelated coding DNA, such as large chromosomal fragments or other functional genes or polypeptide coding regions. Of course, this refers to the DNA segment as originally isolated, and does not exclude genes or coding regions later added to the segment by the hand of man.

[0035] As will be recognized by the skilled artisan, polynucleotides may be single-stranded (coding or antisense) or double-stranded, and may be DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules include HnRNA molecules, which contain introns and correspond to a DNA molecule in a one-to-one manner, and mRNA molecules, which do not contain introns. Additional coding or non-coding sequences may, but need not, be present within a polynucleotide of the present invention, and a polynucleotide may, but need not, be linked to other molecules and/or support materials.

[0036] Polynucleotides may comprise a native sequence (i.e., an endogenous sequence that encodes an ovarian tumor protein or a portion thereof) or may comprise a variant, or a biological or antigenic functional equivalent of such a sequence. Polynucleotide variants may contain one or more substitutions, additions, deletions and/or insertions, as further described below, preferably such that the immunogenicity of the encoded polypeptide is not diminished, relative to a native tumor protein. The effect on the immunogenicity of the encoded polypeptide may generally be assessed as described herein. The term “variants” also encompasses homologous genes of xenogenic origin.

[0037] When comparing polynucleotide or polypeptide sequences, two sequences are said to be “identical” if the sequence of nucleotides or amino acids in the two sequences is the same when aligned for maximum correspondence, as described below. Comparisons between two sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity. A “comparison window” as used herein, refers to a segment of at least about 20 contiguous positions, usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.

[0038] Optimal alignment of sequences for comparison may be conducted using the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., Madison, Wis.), using default parameters. This program embodies several alignment schemes described in the following references: Dayhoff, M. O. (1978) A model of evolutionary change in proteins—Matrices for detecting distant relationships. In Dayhoff, M. O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology vol. 183, Academic Press, Inc., San Diego, Calif.; Higgins, D. G. and Sharp, P. M. (1989) CABIOS 5:151-153; Myers, E. W. and Muller W. (1988) CABIOS 4:11-17; Robinson, E. D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406-425; Sneath, P. H. A. and Sokal, R. R. (1973) Numerical Taxonomy-the Principles and Practice of Numerical Taxonomy, Freeman Press, San Francisco, Calif.; Wilbur, W. J. and Lipman, D. J. (1983) Proc. Natl. Acad., Sci. USA 80:726-730.

[0039] Alternatively, optimal alignment of sequences for comparison may be conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wis.), or by inspection.

[0040] One preferred example of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) NucL. Acids Res. 25:3389-3402 and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 2.0 can be used, for example with the parameters described herein, to determine percent sequence identity for the polynucleotides and polypeptides of the invention. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information. In one illustrative example, cumulative scores can be calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always>0) and N (penalty score for mismatching residues; always<0). For amino acid sequences, a scoring matrix can be used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=4 and a comparison of both strands.

[0041] Preferably, the “percentage of sequence identity” is determined by comparing two optimally aligned sequences over a window of comparison of at least 20 positions, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid bases or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the reference sequence (i.e., the window size) and multiplying the results by 100 to yield the percentage of sequence identity.

[0042] Therefore, the present invention encompasses polynucleotide and polypeptide sequences having substantial identity to the sequences disclosed herein, for example those comprising at least 50% sequence identity, preferably at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a polynucleotide or polypeptide sequence of this invention using the methods described herein, (e.g., BLAST analysis using standard parameters, as described below). One skilled in this art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like.

[0043] In additional embodiments, the present invention provides isolated polynucleotides and polypeptides comprising various lengths of contiguous stretches of sequence identical to or complementary to one or more of the sequences disclosed herein. For example, polynucleotides are provided by this invention that comprise at least about 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more contiguous nucleotides of one or more of the sequences disclosed herein as well as all intermediate lengths there between. It will be readily understood that “intermediate lengths”, in this context, means any length between the quoted values, such as 16, 17, 18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; including all integers through 200-500; 500-1,000, and the like.

[0044] The polynucleotides of the present invention, or fragments thereof, regardless of the length of the coding sequence itself, may be combined with other DNA sequences, such as promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length may vary considerably. It is therefore contemplated that a nucleic acid fragment of almost any length may be employed, with the total length preferably being limited by the ease of preparation and use in the intended recombinant DNA protocol. For example, illustrative DNA segments with total lengths of about 10,000, about 5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 base pairs in length, and the like, (including all intermediate lengths) are contemplated to be useful in many implementations of this invention.

[0045] In other embodiments, the present invention is directed to polynucleotides that are capable of hybridizing under moderately stringent conditions to a polynucleotide sequence provided herein, or a fragment thereof, or a complementary sequence thereof. Hybridization techniques are well known in the art of molecular biology. For purposes of illustration, suitable moderately stringent conditions for testing the hybridization of a polynucleotide of this invention with other polynucleotides include prewashing in a solution of 5× SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); hybridizing at 50° C.-65° C., 5× SSC, overnight; followed by washing twice at 65° C. for 20 minutes with each of 2×, 0.5× and 0.2× SSC containing 0.1% SDS.

[0046] Moreover, it will be appreciated by those of ordinary skill in the art that, as a result of the degeneracy of the genetic code, there are many nucleotide sequences that encode a polypeptide as described herein. Some of these polynucleotides bear minimal homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides that vary due to differences in codon usage are specifically contemplated by the present invention. Further, alleles of the genes comprising the polynucleotide sequences provided herein are within the scope of the present invention. Alleles are endogenous genes that are altered as a result of one or more mutations, such as deletions, additions and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, have an altered structure or function. Alleles may be identified using standard techniques (such as hybridization, amplification and/or database sequence comparison).

[0047] PROBES AND PRIMERS

[0048] In other embodiments of the present invention, the polynucleotide sequences provided herein can be advantageously used as probes or primers for nucleic acid hybridization. As such, it is contemplated that nucleic acid segments that comprise a sequence region of at least about 15 nucleotide long contiguous sequence that has the same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence disclosed herein will find particular utility. Longer contiguous identical or complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 (including all intermediate lengths) and even up to full length sequences will also be of use in certain embodiments.

[0049] The ability of such nucleic acid probes to specifically hybridize to a sequence of interest will enable them to be of use in detecting the presence of complementary sequences in a given sample. However, other uses are also envisioned, such as the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic constructions.

[0050] Polynucleotide molecules having sequence regions consisting of contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides or so (including intermediate lengths as well), identical or complementary to a polynucleotide sequence disclosed herein, are particularly contemplated as hybridization probes for use in, e.g., Southern and Northern blotting. This would allow a gene product, or fragment thereof, to be analyzed, both in diverse cell types and also in various bacterial cells. The total size of fragment, as well as the size of the complementary stretch(es), will ultimately depend on the intended use or application of the particular nucleic acid segment. Smaller fragments will generally find use in hybridization embodiments, wherein the length of the contiguous complementary region may be varied, such as between about 15 and about 100 nucleotides, but larger contiguous complementarity stretches may be used, according to the length complementary sequences one wishes to detect.

[0051] The use of a hybridization probe of about 15-25 nucleotides in length allows the formation of a duplex molecule that is both stable and selective. Molecules having contiguous complementary sequences over stretches greater than 15 bases in length are generally preferred, though, in order to increase stability and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid molecules obtained. One will generally prefer to design nucleic acid molecules having gene-complementary stretches of 15 to 25 contiguous nucleotides, or even longer where desired.

[0052] Hybridization probes may be selected from any portion of any of the sequences disclosed herein. All that is required is to review the sequence set forth in SEQ ID NO:1-38, or to any continuous portion of the sequence, from about 15-25 nucleotides in length up to and including the full length sequence, that one wishes to utilize as a probe or primer. The choice of probe and primer sequences may be governed by various factors. For example, one may wish to employ primers from towards the termini of the total sequence.

[0053] Small polynucleotide segments or fragments may be readily prepared by, for example, directly synthesizing the fragment by chemical means, as is commonly practiced using an automated oligonucleotide synthesizer. Also, fragments may be obtained by application of nucleic acid reproduction technology, such as the PCR™ technology of U.S. Pat. No. 4,683,202 (incorporated herein by reference), by introducing selected sequences into recombinant vectors for recombinant production, and by other recombinant DNA techniques generally known to those of skill in the art of molecular biology.

[0054] The nucleotide sequences of the invention may be used for their ability to selectively form duplex molecules with complementary stretches of the entire gene or gene fragments of interest. Depending on the application envisioned, one will typically desire to employ varying conditions of hybridization to achieve varying degrees of selectivity of probe towards target sequence. For applications requiring high selectivity, one will typically desire to employ relatively stringent conditions to form the hybrids, e.g., one will select relatively low salt and/or high temperature conditions, such as provided by a salt concentration of from about 0.02 M to about 0.15 M salt at temperatures of from about 50° C. to about 70° C. Such selective conditions tolerate little, if any, mismatch between the probe and the template or target strand, and would be particularly suitable for isolating related sequences.

[0055] Of course, for some applications, for example, where one desires to prepare mutants employing a mutant primer strand hybridized to an underlying template, less stringent (reduced stringency) hybridization conditions will typically be needed in order to allow formation of the heteroduplex. In these circumstances, one may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M salt, at temperatures ranging from about 20° C. to about 55° C. Cross-hybridizing species can thereby be readily identified as positively hybridizing signals with respect to control hybridizations. In any case, it is generally appreciated that conditions can be rendered more stringent by the addition of increasing amounts of formamide, which serves to destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization conditions can be readily manipulated, and thus will generally be a method of choice depending on the desired results.

[0056] Polynucleotide Identification And Characterization

[0057] Polynucleotides may be identified, prepared and/or manipulated using any of a variety of well established techniques. For example, a polynucleotide may be identified, as described in more detail below, by screening a microarray of cDNAs for tumor-associated expression (i. e., expression that is at least two fold greater in a tumor than in normal tissue, as determined using a representative assay provided herein). Such screens may be performed, for example, using a Synteni microarray (Palo Alto, Calif.) according to the manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. Acad. Sci. USA 93:10614-10619, 1996 and Heller et al., Proc. Natl. Acad. Sci. USA 94:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA prepared from cells expressing the proteins described herein, such as ovarian tumor cells. Such polynucleotides may be amplified via polymerase chain reaction (PCR). For this approach, sequence-specific primers may be designed based on the sequences provided herein, and may be purchased or synthesized.

[0058] An amplified portion of a polynucleotide of the present invention may be used to isolate a full length gene from a suitable library (e.g., an ovarian tumor cDNA library) using well known techniques. Within such techniques, a library (cDNA or genomic) is screened using one or more polynucleotide probes or primers suitable for amplification. Preferably, a library is size-selected to include larger molecules. Random primed libraries may also be preferred for identifying 5′ and upstream regions of genes. Genomic libraries are preferred for obtaining introns and extending 5′ sequences.

[0059] For hybridization techniques, a partial sequence may be labeled (e.g., by nick-translation or end-labeling with ³²p) using well known techniques. A bacterial or bacteriophage library is then generally screened by hybridizing filters containing denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, N.Y., 1989). Hybridizing colonies or plaques are selected and expanded, and the DNA is isolated for further analysis. cDNA clones may be analyzed to determine the amount of additional sequence by, for example, PCR using a primer from the partial sequence and a primer from the vector. Restriction maps and partial sequences may be generated to identify one or more overlapping clones. The complete sequence may then be determined using standard techniques, which may involve generating a series of deletion clones. The resulting overlapping sequences can then assembled into a single contiguous sequence. A full length cDNA molecule can be generated by ligating suitable fragments, using well known techniques.

[0060] Alternatively, there are numerous amplification techniques for obtaining a full length coding sequence from a partial cDNA sequence. Within such techniques, amplification is generally performed via PCR. Any of a variety of commercially available kits may be used to perform the amplification step. Primers may be designed using, for example, software well known in the art. Primers are preferably 22-30 nucleotides in length, have a GC content of at least 50% and anneal to the target sequence at temperatures of about 68° C. to 72° C. The amplified region may be sequenced as described above, and overlapping sequences assembled into a contiguous sequence.

[0061] One such amplification technique is inverse PCR (see Triglia et al., Nucl. Acids Res. 16:8186, 1988), which uses restriction enzymes to generate a fragment in the known region of the gene. The fragment is then circularized by intramolecular ligation and used as a template for PCR with divergent primers derived from the known region. Within an alternative approach, sequences adjacent to a partial sequence may be retrieved by amplification with a primer to a linker sequence and a primer specific to a known region. The amplified sequences are typically subjected to a second round of amplification with the same linker primer and a second primer specific to the known region. A variation on this procedure, which employs two primers that initiate extension in opposite directions from the known sequence, is described in WO 96/38591. Another such technique is known as “rapid amplification of cDNA ends” or RACE. This technique involves the use of an internal primer and an external primer, which hybridizes to a polyA region or vector sequence, to identify sequences that are 5′ and 3′ of a known sequence. Additional techniques include capture PCR (Lagerstrom et al., PCR Methods Applic. 1:111-19, 1991) and walking PCR (Parker et al., NucL Acids. Res. 19:3055-60, 1991). Other methods employing amplification may also be employed to obtain a full length cDNA sequence.

[0062] In certain instances, it is possible to obtain a full length cDNA sequence by analysis of sequences provided in an expressed sequence tag (EST) database, such as that available from GenBank. Searches for overlapping ESTs may generally be performed using well known programs (e.g., NCBI BLAST searches), and such ESTs may be used to generate a contiguous full length sequence. Full length DNA sequences may also be obtained by analysis of genomic fragments.

[0063] Polynucleotide Expression In Host Cells

[0064] In other embodiments of the invention, polynucleotide sequences or fragments thereof which encode polypeptides of the invention, or fusion proteins or functional equivalents thereof, may be used in recombinant DNA molecules to direct expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences that encode substantially the same or a functionally equivalent amino acid sequence may be produced and these sequences may be used to clone and express a given polypeptide.

[0065] As will be understood by those of skill in the art, it may be advantageous in some instances to produce polypeptide-encoding nucleotide sequences possessing non-naturally occurring codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can be selected to increase the rate of protein expression or to produce a recombinant RNA transcript having desirable properties, such as a half-life which is longer than that of a transcript generated from the naturally occurring sequence.

[0066] Moreover, the polynucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter polypeptide encoding sequences for a variety of reasons, including but not limited to, alterations which modify the cloning, processing, and/or expression of the gene product. For example, DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. In addition, site-directed mutagenesis may be used to insert new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, or introduce mutations, and so forth.

[0067] In another embodiment of the invention, natural, modified, or recombinant nucleic acid sequences may be ligated to a heterologous sequence to encode a fusion protein. For example, to screen peptide libraries for inhibitors of polypeptide activity, it may be useful to encode a chimeric protein that can be recognized by a commercially available antibody. A fusion protein may also be engineered to contain a cleavage site located between the polypeptide-encoding sequence and the heterologous protein sequence, so that the polypeptide may be cleaved and purified away from the heterologous moiety.

[0068] Sequences encoding a desired polypeptide may be synthesized, in whole or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl. Acids Res. Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. For example, peptide synthesis can be performed using various solid-phase techniques (Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be achieved, for example, using the ABI 431A Peptide Synthesizer (Perkin Elmer, Palo Alto, Calif.).

[0069] A newly synthesized peptide may be substantially purified by preparative high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and Molecular Principles, W H Freeman and Co., New York, N.Y.) or other comparable techniques available in the art. The composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman degradation procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with sequences from other proteins, or any part thereof, to produce a variant polypeptide.

[0070] In order to express a desired polypeptide, the nucleotide sequences encoding the polypeptide, or functional equivalents, may be inserted into appropriate expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence. Methods which are well known to those skilled in the art may be used to construct expression vectors containing sequences encoding a polypeptide of interest and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. Such techniques are described in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. N.Y.

[0071] A variety of expression vector/host systems may be utilized to contain and express polynucleotide sequences. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems.

[0072] The “control elements” or “regulatory sequences” present in an expression vector are those non-translated regions of the vector—enhancers, promoters, 5′ and 3′ untranslated regions—which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used. For example, when cloning in bacterial systems, inducible promoters such as the hybrid lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or PSPORT1 plasmid (Gibco BRL, Gaithersburg, Md.) and the like may be used. In mammalian cell systems, promoters from mammalian genes or from mammalian viruses are generally preferred. If it is necessary to generate a cell line that contains multiple copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be advantageously used with an appropriate selectable marker.

[0073] In bacterial systems, a number of expression vectors may be selected depending upon the use intended for the expressed polypeptide. For example, when large quantities are needed, for example for the induction of antibodies, vectors which direct high level expression of fusion proteins that are readily purified may be used. Such vectors include, but are not limited to, the multifinctional E. coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence encoding the polypeptide of interest may be ligated into the vector in frame with sequences for the amino-terminal Met and the subsequent 7 residues of .beta.-galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509); and the like. pGEX Vectors (Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione. Proteins made in such systems may be designed to include heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of interest can be released from the GST moiety at will.

[0074] In the yeast, Saccharomyces cerevisiae, a number of vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods Enzymol. 153:516-544.

[0075] In cases where plant expression vectors are used, the expression of sequences encoding polypeptides may be driven by any of a number of promoters. For example, viral promoters such as the 35S and 19S promoters of CaMV may be used alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-311. Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. Such techniques are described in a number of generally available reviews (see, for example, Hobbs, S. or Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York, N.Y.; pp. 191-196).

[0076] An insect system may also be used to express a polypeptide of interest. For example, in one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a non-essential region of the virus, such as the polyhedrin gene, and placed under control of the polyhedrin promoter. Successfuil insertion of the polypeptide-encoding sequence will render the polyhedrin gene inactive and produce recombinant virus lacking coat protein. The recombinant viruses may then be used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which the polypeptide of interest may be expressed (Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. 91 :3224-3227).

[0077] In mammalian host cells, a number of viral-based expression systems are generally available. For example, in cases where an adenovirus is used as an expression vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain a viable virus which is capable of expressing the polypeptide in infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells.

[0078] Specific initiation signals may also be used to achieve more efficient translation of sequences encoding a polypeptide of interest. Such signals include the ATG initiation codon and adjacent sequences. In cases where sequences encoding the polypeptide, its initiation codon, and upstream sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a portion thereof, is inserted, exogenous translational control signals including the ATG initiation codon should be provided. Furthermore, the initiation codon should be in the correct reading frame to ensure translation of the entire insert. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers which are appropriate for the particular cell system which is used, such as those described in the literature (Scharf, D. et al. (1994) Results Probi. Cell Differ. 20:125-162).

[0079] In addition, a host cell strain may be chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a “prepro” form of the protein may also be used to facilitate correct insertion, folding and/or function. Different host cells such as CHO, HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery and characteristic mechanisms for such post-translational activities, may be chosen to ensure the correct modification and processing of the foreign protein.

[0080] For long-term, high-yield production of recombinant proteins, stable expression is generally preferred. For example, cell lines which stably express a polynucleotide of interest may be transformed using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells which successfully express the introduced sequences. Resistant clones of stably transformed cells may be proliferated using tissue culture techniques appropriate to the cell type.

[0081] Any number of selection systems may be used to recover transformed cell lines. These include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.—or aprt.sup.—cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can be used as the basis for selection; for example, dhfr which confers resistance to methotrexate (Wigler, M. et al (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14); and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). Additional selectable genes have been described, for example, trpB, which allows cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-51). Recently, the use of visible markers has gained popularity with such markers as anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate luciferin, being widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131).

[0082] Although the presence/absence of marker gene expression suggests that the gene of interest is also present, its presence and expression may need to be confirmed. For example, if the sequence encoding a polypeptide is inserted within a marker gene sequence, recombinant cells containing sequences can be identified by the absence of marker gene function. Alternatively, a marker gene can be placed in tandem with a polypeptide-encoding sequence under the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of the tandem gene as well.

[0083] Alternatively, host cells which contain and express a desired polynucleotide sequence may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein.

[0084] A variety of protocols for detecting and measuring the expression of polynucleotide-encoded products, using either polyclonal or monoclonal antibodies specific for the product are known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on a given polypeptide may be preferred for some applications, but a competitive binding assay may also be employed. These and other assays are described, among other places, in Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. E. et al. (1983; J. Exp. Med. 158:1211-1216).

[0085] A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides include oligolabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the sequences, or any portions thereof may be cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of commercially available kits. Suitable reporter molecules or labels, which may be used include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, magnetic particles, and the like.

[0086] Host cells transformed with a polynucleotide sequence of interest may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a recombinant cell may be secreted or contained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides of the invention may be designed to contain signal sequences which direct secretion of the encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other recombinant constructions may be used to join sequences encoding a polypeptide of interest to nucleotide sequence encoding a polypeptide domain which will facilitate purification of soluble proteins. Such purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals, protein A domains that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, Calif.) between the purification domain and the encoded polypeptide may be used to facilitate purification. One such expression vector provides for expression of a fusion protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as described in Porath, J. et al. (1992, Prot. Exp. Purif 3:263-281) while the enterokinase cleavage site provides a means for purifying the desired polypeptide from the fusion protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol. 12:441-453).

[0087] In addition to recombinant production methods, polypeptides of the invention, and fragments thereof, may be produced by direct peptide synthesis using solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 85:2149-2154). Protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431A Peptide Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically synthesized separately and combined using chemical methods to produce the full length molecule.

[0088] Site-Specific Mutacenesis

[0089] Site-specific mutagenesis is a technique useful in the preparation of individual peptides, or biologically functional equivalent polypeptides, through specific mutagenesis of the underlying polynucleotides that encode them. The technique, well-known to those of skill in the art, further provides a ready ability to prepare and test sequence variants, for example, incorporating one or more of the foregoing considerations, by introducing one or more nucleotide sequence changes into the DNA. Site-specific mutagenesis allows the production of mutants through the use of specific oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. Mutations may be employed in a selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise change the properties of the polynucleotide itself, and/or alter the properties, activity, composition, stability, or primary sequence of the encoded polypeptide.

[0090] In certain embodiments of the present invention, the inventors contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or more properties of the encoded polypeptide, such as the antigenicity of a polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the art, and are widely used to create variants of both polypeptides and polynucleotides. For example, site-specific mutagenesis is often used to alter a specific portion of a DNA molecule. In such embodiments, a primer comprising typically about 14 to about 25 nucleotides or so in length is employed, with about 5 to about 10 residues on both sides of the junction of the sequence being altered.

[0091] As will be appreciated by those of skill in the art, site-specific mutagenesis techniques have often employed a phage vector that exists in both a single stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage. These phage are readily commercially-available and their use is generally well-known to those skilled in the art. Double-stranded plasmids are also routinely employed in site directed mutagenesis that eliminates the step of transferring the gene of interest from a plasmid to a phage.

[0092] In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector or melting apart of two strands of a double-stranded vector that includes within its sequence a DNA sequence that encodes the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically. This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated sequence and the second strand bears the desired mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. coli cells, and clones are selected which include recombinant vectors bearing the mutated sequence arrangement.

[0093] The preparation of sequence variants of the selected peptide-encoding DNA segments using site-directed mutagenesis provides a means of producing potentially useful species and is not meant to be limiting as there are other ways in which sequence variants of peptides and the DNA sequences encoding them may be obtained. For example, recombinant vectors encoding the desired peptide sequence may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence variants. Specific details regarding these methods and protocols are found in the teachings of Maloy et al., 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and Maniatis et aL, 1982, each incorporated herein by reference, for that purpose.

[0094] As used herein, the term “oligonucleotide directed mutagenesis procedure” refers to template-dependent processes and vector-mediated propagation which result in an increase in the concentration of a specific nucleic acid molecule relative to its initial concentration, or in an increase in the concentration of a detectable signal, such as amplification. As used herein, the term “oligonucleotide directed mutagenesis procedure” is intended to refer to a process that involves the template-dependent extension of a primer molecule. The term template dependent process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the sequence of the newly synthesized strand of nucleic acid is dictated by the well-known rules of complementary base pairing (see, for example, Watson, 1987). Typically, vector mediated methodologies involve the introduction of the nucleic acid fragment into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of the amplified nucleic acid fragment. Examples of such methodologies are provided by U.S. Pat. No. 4,237,224, specifically incorporated herein by reference in its entirety.

[0095] Polynucleotide Amplification Techniques

[0096] A number of template dependent processes are available to amplify the target sequences of interest present in a sample. One of the best known amplification methods is the polymerase chain reaction (PCR™) which is described in detail in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by reference in its entirety. Briefly, in PCR™, two primer sequences are prepared which are complementary to regions on opposite complementary strands of the target sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture along with a DNA polymerase (e.g., Taq polymerase). If the target sequence is present in a sample, the primers will bind to the target and the polymerase will cause the primers to be extended along the target sequence by adding on nucleotides. By raising and lowering the temperature of the reaction mixture, the extended primers will dissociate from the target to form reaction products, excess primers will bind to the target and to the reaction product and the process is repeated. Preferably reverse transcription and PCR™ amplification procedure may be performed in order to quantify the amount of mRNA amplified. Polymerase chain reaction methodologies are well known in the art.

[0097] Another method for amplification is the ligase chain reaction (referred to as LCR), disclosed in Eur. Pat. Appl. Publ. No. 320,308 (specifically incorporated herein by reference in its entirety). In LCR, two complementary probe pairs are prepared, and in the presence of the target sequence, each pair will bind to opposite complementary strands of the target such that they abut. In the presence of a ligase, the two probe pairs will link to form a single unit. By temperature cycling, as in PCR™, bound ligated units dissociate from the target and then serve as “target sequences” for ligation of excess probe pairs. U.S. Pat. No. 4,883,750, incorporated herein by reference in its entirety, describes an alternative method of amplification similar to LCR for binding probe pairs to a target sequence.

[0098] Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. PCT/US87/00880, incorporated herein by reference in its entirety, may also be used as still another amplification method in the present invention. In this method, a replicative sequence of RNA that has a region complementary to that of a target is added to a sample in the presence of an RNA polymerase. The polymerase will copy the replicative sequence that can then be detected.

[0099] An isothermal amplification method, in which restriction endonucleases and ligases are used to achieve the amplification of target molecules that contain nucleotide 5′-[α-thio]triphosphates in one strand of a restriction site (Walker et al., 1992, incorporated herein by reference in its entirety), may also be useful in the amplification of nucleic acids in the present invention.

[0100] Strand Displacement Amplification (SDA) is another method of carrying out isothermal amplification of nucleic acids which involves multiple rounds of strand displacement and synthesis, i.e. nick translation. A similar method, called Repair Chain Reaction (RCR) is another method of amplification which may be useful in the present invention and is involves annealing several probes throughout a region targeted for amplification, followed by a repair reaction in which only two of the four bases are present. The other two bases can be added as biotinylated derivatives for easy detection. A similar approach is used in SDA.

[0101] Sequences can also be detected using a cyclic probe reaction (CPR). In CPR, a probe having a 3′ and 5′ sequences of non-target DNA and an internal or “middle” sequence of the target protein specific RNA is hybridized to DNA which is present in a sample. Upon hybridization, the reaction is treated with RNaseH, and the products of the probe are identified as distinctive products by generating a signal that is released after digestion. The original template is annealed to another cycling probe and the reaction is repeated. Thus, CPR involves amplifying a signal generated by hybridization of a probe to a target gene specific expressed nucleic acid.

[0102] Still other amplification methods described in Great Britain Pat. Appl. No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025, each of which is incorporated herein by reference in its entirety, may be used in accordance with the present invention. In the former application, “modified” primers are used in a PCR-like, template and enzyme dependent synthesis. The primers may be modified by labeling with a capture moiety (e.g., biotin) and/or a detector moiety (e.g., enzyme). In the latter application, an excess of labeled probes is added to a sample. In the presence of the target sequence, the probe binds and is cleaved catalytically. After cleavage, the target sequence is released intact to be bound by excess probe. Cleavage of the labeled probe signals the presence of the target sequence.

[0103] Other nucleic acid amplification procedures include transcription-based amplification systems (TAS) (Kwoh et al., 1989; PCT Intl. Pat. Appl. Publ. No. WO 88/10315, incorporated herein by reference in its entirety), including nucleic acid sequence based amplification (NASBA) and 3SR. In NASBA, the nucleic acids can be prepared for amplification by standard phenol/chloroform extraction, heat denaturation of a sample, treatment with lysis buffer and minispin columns for isolation of DNA and RNA or guanidinium chloride extraction of RNA. These amplification techniques involve annealing a primer that has sequences specific to the target sequence. Following polymerization, DNA/RNA hybrids are digested with RNase H while double stranded DNA molecules are heat-denatured again. In either case the single stranded DNA is made fully double stranded by addition of second target-specific primer, followed by polymerization. The double stranded DNA molecules are then multiply transcribed by a polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNAs are reverse transcribed into DNA, and transcribed once again with a polymerase such as T7 or SP6. The resulting products, whether truncated or complete, indicate target-specific sequences.

[0104] Eur. Pat. Appl. Publ. No. 329,822, incorporated herein by reference in its entirety, disclose a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA (“ssRNA”), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance with the present invention. The ssRNA is a first template for a first primer oligonucleotide, which is elongated by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then removed from resulting DNA:RNA duplex by the action of ribonuclease H (RNase H, an RNase specific for RNA in a duplex with either DNA or RNA). The resultant ssDNA is a second template for a second primer, which also includes the sequences of an RNA polymerase promoter (exemplified by T7 RNA polymerase) 5′ to its homology to its template. This primer is then extended by DNA polymerase (exemplified by the large “Klenow” fragment of E. coli DNA polymerase I), resulting as a double-stranded DNA (“dsDNA”) molecule, having a sequence identical to that of the original RNA between the primers and having additionally, at one end, a promoter sequence. This promoter sequence can be used by the appropriate RNA polymerase to make many RNA copies of the DNA. These copies can then re-enter the cycle leading to very swift amplification. With proper choice of enzymes, this amplification can be done isothermally without addition of enzymes at each cycle. Because of the cyclical nature of this process, the starting sequence can be chosen to be in the form of either DNA or RNA.

[0105] PCT Intl. Pat. Appl. Publ. No. WO 89/06700, incorporated herein by reference in its entirety, disclose a nucleic acid sequence amplification scheme based on the hybridization of a promoter/primer sequence to a target single-stranded DNA (“ssDNA”) followed by transcription of many RNA copies of the sequence. This scheme is not cyclic; i. e. new templates are not produced from the resultant RNA transcripts. Other amplification methods include “RACE” (Frohman, 1990), and “one-sided PCR” (Ohara, 1989) which are well-known to those of skill in the art.

[0106] Methods based on ligation of two (or more) oligonucleotides in the presence of nucleic acid having the sequence of the resulting “di-oligonucleotide”, thereby amplifying the di-oligonucleotide (Wu and Dean, 1996, incorporated herein by reference in its entirety), may also be used in the amplification of DNA sequences of the present invention.

[0107] Biological Functional Equivalents

[0108] Modification and changes may be made in the structure of the polynucleotides and polypeptides of the present invention and still obtain a functional molecule that encodes a polypeptide with desirable characteristics. As mentioned above, it is often desirable to introduce one or more mutations into a specific polynucleotide sequence. In certain circumstances, the resulting encoded polypeptide sequence is altered by this mutation, or in other cases, the sequence of the polypeptide is unchanged by one or more mutations in the encoding polynucleotide.

[0109] When it is desirable to alter the amino acid sequence of a polypeptide to create an equivalent, or even an improved, second-generation molecule, the amino acid changes may be achieved by changing one or more of the codons of the encoding DNA sequence, according to Table 2.

[0110] For example, certain amino acids may be substituted for other amino acids in a protein structure without appreciable loss of interactive binding capacity with structures such as, for example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it is the interactive capacity and nature of a protein that defines that protein's biological functional activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of course, its underlying DNA coding sequence, and nevertheless obtain a protein with like properties. It is thus contemplated by the inventors that various changes may be made in the peptide sequences of the disclosed compositions, or corresponding DNA sequences which encode said peptides without appreciable loss of their biological utility or activity. TABLE 2 Amino Acids Codons Alanine Ala A GCA GCC GCG GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acid Glu E GAA GAG Phenylalanine Phe F UUC UUU Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAA AAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

[0111] In making such changes, the hydropathic index of amino acids may be considered. The importance of the hydropathic amino acid index in conferring interactive 5 biologic function on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporated herein by reference). It is accepted that the relative hydropathic character of the amino acid contributes to the secondary structure of the resultant protein, which in turn defines the interaction of the protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and the like. Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (−0.4); threonine (−0.7); serine (−0.8); tryptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2); glutamate (−3.5); glutamine (−3.5); aspartate (−3.5); asparagine (−3.5); lysine (−3.9); and arginine (−4.5).

[0112] It is known in the art that certain amino acids may be substituted by other amino acids having a similar hydropathic index or score and still result in a protein with similar biological activity, i.e. still obtain a biological functionally equivalent protein. In making such changes, the substitution of amino acids whose hydropathic indices are within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred. It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 4,554,101 (specifically incorporated herein by reference in its entirety), states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein.

[0113] As detailed in U.S. Pat. No. 4,554,101, the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0±1); glutamate (+3.0±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (−0.4); proline (−0.5±1); alanine (−0.5); histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5); leucine (−1.8); isoleucine (−1.8); tyrosine (−2.3); phenylalanine (−2.5); tryptophan (−3.4). It is understood that an amino acid can be substituted for another having a similar hydrophilicity value and still obtain a biologically equivalent, and in particular, an immunologically equivalent protein. In such changes, the substitution of amino acids whose hydrophilicity values are within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred.

[0114] As outlined above, amino acid substitutions are generally therefore based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that take various of the foregoing characteristics into consideration are well known to those of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine.

[0115] In addition, any polynucleotide may be further modified to increase stability in vivo. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5′ and/or 3′ ends; the use of phosphorothioate or 2′ O-methyl rather than phosphodiesterase linkages in the backbone; and/or the inclusion of nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and uridine.

[0116] In Vivo Polynucleotide Delivery Techniques

[0117] In additional embodiments, genetic constructs comprising one or more of the polynucleotides of the invention are introduced into cells in vivo. This may be achieved using any of a variety or well known approaches, several of which are outlined below for the purpose of illustration.

[0118] 1. Adenovirus

[0119] One of the preferred methods for in vivo delivery of one or more nucleic acid sequences involves the use of an adenovirus expression vector. “Adenovirus expression vector” is meant to include those constructs containing adenovirus sequences sufficient to (a) support packaging of the construct and (b) to express a polynucleotide that has been cloned therein in a sense or antisense orientation. Of course, in the context of an antisense construct, expression does not require that the gene product be synthesized.

[0120] The expression vector comprises a genetically engineered form of an adenovirus. Knowledge of the genetic organization of adenovirus, a 36 kb, linear, double-stranded DNA virus, allows substitution of large pieces of adenoviral DNA with foreign sequences up to 7 kb (Grunhaus and Horwitz, 1992). In contrast to retrovirus, the adenoviral infection of host cells does not result in chromosomal integration because adenoviral DNA can replicate in an episomal manner without potential genotoxicity. Also, adenoviruses are structurally stable, and no genome rearrangement has been detected after extensive amplification. Adenovirus can infect virtually all epithelial cells regardless of their cell cycle stage. So far, adenoviral infection appears to be linked only to mild disease such as acute respiratory disease in humans.

[0121] Adenovirus is particularly suitable for use as a gene transfer vector because of its mid-sized genome, ease of manipulation, high titer, wide target-cell range and high infectivity. Both ends of the viral genome contain 100-200 base pair inverted repeats (ITRs), which are cis elements necessary for viral DNA replication and packaging. The early (E) and late (L) regions of the genome contain different transcription units that are divided by the onset of viral DNA replication. The El region (EIA and E1B) encodes proteins responsible for the regulation of transcription of the viral genome and a few cellular genes. The expression of the E2 region (E2A and E2B) results in the synthesis of the proteins for viral DNA replication. These proteins are involved in DNA replication, late gene expression and host cell shut-off (Renan, 1990). The products of the late genes, including the majority of the viral capsid proteins, are expressed only after significant processing of a single primary transcript issued by the major late promoter (MLP). The MLP, (located at 16.8 m.u.) is particularly efficient during the late phase of infection, and all the mRNA's issued from this promoter possess a 5′-tripartite leader (TPL) sequence which makes them preferred mRNA's for translation.

[0122] In a current system, recombinant adenovirus is generated from homologous recombination between shuttle vector and provirus vector. Due to the possible recombination between two proviral vectors, wild-type adenovirus may be generated from this process. Therefore, it is critical to isolate a single clone of virus from an individual plaque and examine its genomic structure.

[0123] Generation and propagation of the current adenovirus vectors, which are replication deficient, depend on a unique helper cell line, designated 293, which was transformed from human embryonic kidney cells by Ad5 DNA fragments and constitutively expresses El proteins (Graham et al., 1977). Since the E3 region is dispensable from the adenovirus genome (Jones and Shenk, 1978), the current adenovirus vectors, with the help of 293 cells, carry foreign DNA in either the El, the D3 or both regions (Graham and Prevec, 1991). In nature, adenovirus can package approximately 105% of the wild-type genome (Ghosh-Choudhury et al., 1987), providing capacity for about 2 extra kB of DNA. Combined with the approximately 5.5 kB of DNA that is replaceable in the El and E3 regions, the maximum capacity of the current adenovirus vector is under 7.5 kB, or about 15% of the total length of the vector. More than 80% of the adenovirus viral genome remains in the vector backbone and is the source of vector-borne cytotoxicity. Also, the replication deficiency of the El-deleted virus is incomplete. For example, leakage of viral gene expression has been observed with the currently available vectors at high multiplicities of infection (MOI) (Mulligan, 1993).

[0124] Helper cell lines may be derived from human cells such as human embryonic kidney cells, muscle cells, hematopoietic cells or other human embryonic mesenchymal or epithelial cells. Alternatively, the helper cells may be derived from the cells of other mammalian species that are permissive for human adenovirus. Such cells include, e.g., Vero cells or other monkey embryonic mesenchymal or epithelial cells. As stated above, the currently preferred helper cell line is 293.

[0125] Recently, Racher et al. (1995) disclosed improved methods for culturing 293 cells and propagating adenovirus. In one format, natural cell aggregates are grown by inoculating individual cells into 1 liter siliconized spinner flasks (Techne, Cambridge, UK) containing 100-200 ml of medium. Following stirring at 40 rpm, the cell viability is estimated with trypan blue. In another format, Fibra-Cel microcarriers (Bibby Sterlin, Stone, UK) (5 g/l) is employed as follows. A cell inoculum, resuspended in 5 ml of medium, is added to the carrier (50 ml) in a 250 ml Erlenmeyer flask and left stationary, with occasional agitation, for 1 to 4 h. The medium is then replaced with 50 ml of fresh medium and shaking initiated. For virus production, cells are allowed to grow to about 80% confluence, after which time the medium is replaced (to 25% of the final volume) and adenovirus added at an MOI of 0.05. Cultures are left stationary overnight, following which the volume is increased to 100% and shaking commenced for another 72 h.

[0126] Other than the requirement that the adenovirus vector be replication defective, or at least conditionally defective, the nature of the adenovirus vector is not believed to be crucial to the successful practice of the invention. The adenovirus may be of any of the 42 different known serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the preferred starting material in order to obtain a conditional replication-defective adenovirus vector for use in the present invention, since Adenovirus type 5 is a human adenovirus about which a great deal of biochemical and genetic information is known, and it has historically been used for most constructions employing adenovirus as a vector.

[0127] As stated above, the typical vector according to the present invention is replication defective and will not have an adenovirus El region. Thus, it will be most convenient to introduce the polynucleotide encoding the gene of interest at the position from which the El-coding sequences have been removed. However, the position of insertion of the construct within the adenovirus sequences is not critical to the invention. The polynucleotide encoding the gene of interest may also be inserted in lieu of the deleted E3 region in E3 replacement vectors as described by Karlsson et al. (1986) or in the E4 region where a helper cell line or helper virus complements the E4 defect.

[0128] Adenovirus is easy to grow and manipulate and exhibits broad host range in vitro and in vivo. This group of viruses can be obtained in high titers, e.g., 10⁹-10¹¹ plaque-forming units per ml, and they are highly infective. The life cycle of adenovirus does not require integration into the host cell genome. The foreign genes delivered by adenovirus vectors are episomal and, therefore, have low genotoxicity to host cells. No side effects have been reported in studies of vaccination with wild-type adenovirus (Couch et al., 1963; Top et al, 1971), demonstrating their safety and therapeutic potential as in vivo gene transfer vectors.

[0129] Adenovirus vectors have been used in eukaryotic gene expression (Levrero et al., 1991; Gomez-Foix et al., 1992) and vaccine development (Grunhaus and Horwitz, 1992; Graham and Prevec, 1992). Recently, animal studies suggested that recombinant adenovirus could be used for gene therapy (Stratford-Perricaudet and Perricaudet, 1991; Stratford-Perricaudet et al., 1990; Rich et al., 1993). Studies in administering recombinant adenovirus to different tissues include trachea instillation (Rosenfeld et aL, 1991; Rosenfeld et al., 1992), muscle injection (Ragot et al., 1993), peripheral intravenous injections (Herz and Gerard, 1993) and stereotactic inoculation into the brain (Le Gal La Salle et al., 1993).

[0130] 2. Retroviruses

[0131] The retroviruses are a group of single-stranded RNA viruses characterized by an ability to convert their RNA to double-stranded DNA in infected cells by a process of reverse-transcription (Coffin, 1990). The resulting DNA then stably integrates into cellular chromosomes as a provirus and directs synthesis of viral proteins. The integration results in the retention of the viral gene sequences in the recipient cell and its descendants. The retroviral genome contains three genes, gag, pol, and env that code for capsid proteins, polymerase enzyme, and envelope components, respectively. A sequence found upstream from the gag gene contains a signal for packaging of the genome into virions. Two long terminal repeat (LTR) sequences are present at the 5′ and 3′ ends of the viral genome. These contain strong promoter and enhancer sequences and are also required for integration in the host cell genome (Coffin, 1990).

[0132] In order to construct a retroviral vector, a nucleic acid encoding one or more oligonucleotide or polynucleotide sequences of interest is inserted into the viral genome in the place of certain viral sequences to produce a virus that is replication-defective. In order to produce virions, a packaging cell line containing the gag, pol, and env genes but without the LTR and packaging components is constructed (Mann et al., 1983). When a recombinant plasmid containing a cDNA, together with the retroviral LTR and packaging sequences is introduced into this cell line (by calcium phosphate precipitation for example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be packaged into viral particles, which are then secreted into the culture media (Nicolas and Rubenstein, 1988; Temin, 1986; Mann et al., 1983). The media containing the recombinant retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral vectors are able to infect a broad variety of cell types. However, integration and stable expression require the division of host cells (Paskind et al., 1975).

[0133] A novel approach designed to allow specific targeting of retrovirus vectors was recently developed based on the chemical modification of a retrovirus by the chemical addition of lactose residues to the viral envelope. This modification could permit the specific infection of hepatocytes via sialoglycoprotein receptors.

[0134] A different approach to targeting of recombinant retroviruses was designed in which biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor were used. The antibodies were coupled via the biotin components by using streptavidin (Roux et al., 1989). Using antibodies against major histocompatibility complex class I and class II antigens, they demonstrated the infection of a variety of human cells that bore those surface antigens with an ecotropic virus in vitro (Roux et al, 1989).

[0135] 3. Adeno-Associated Viruses

[0136] AAV (Ridgeway, 1988; Hermonat and Muzycska, 1984) is a parovirus, discovered as a contamination of adenoviral stocks. It is a ubiquitous virus (antibodies are present in 85% of the US human population) that has not been linked to any disease. It is also classified as a dependovirus, because its replications is dependent on the presence of a helper virus, such as adenovirus. Five serotypes have been isolated, of which AAV-2 is the best characterized. AAV has a single-stranded linear DNA that is encapsidated into capsid proteins VP1, VP2 and VP3 to form an icosahedral virion of 20 to 24 nm in diameter (Muzyczka and McLaughlin, 1988).

[0137] The AAV DNA is approximately 4.7 kilobases long. It contains two open reading frames and is flanked by two ITRs (FIG. 2). There are two major genes in the AAV genome: rep and cap. The rep gene codes for proteins responsible for viral replications, whereas cap codes for capsid protein VP1-3. Each ITR forms a T-shaped hairpin structure. These terminal repeats are the only essential cis components of the AAV for chromosomal integration. Therefore, the AAV can be used as a vector with all viral coding sequences removed and replaced by the cassette of genes for delivery. Three viral promoters have been identified and named p5, p19, and p40, according to their map position. Transcription from p5 and p19 results in production of rep proteins, and transcription from p40 produces the capsid proteins (Hermonat and Muzyczka, 1984).

[0138] There are several factors that prompted researchers to study the possibility of using rAAV as an expression vector One is that the requirements for delivering a gene to integrate into the host chromosome are surprisingly few. It is necessary to have the 145-bp ITRs, which are only 6% of the AAV genome. This leaves room in the vector to assemble a 4.5-kb DNA insertion. While this carrying capacity may prevent the AAV from delivering large genes, it is amply suited for delivering the antisense constructs of the present invention.

[0139] AAV is also a good choice of delivery vehicles due to its safety. There is a relatively complicated rescue mechanism: not only wild type adenovirus but also AAV genes are required to mobilize rAAV. Likewise, AAV is not pathogenic and not associated with any disease. The removal of viral coding sequences minimizes immune reactions to viral gene expression, and therefore, rAAV does not evoke an inflammatory response.

[0140] 4. Other Viral Vectors As Expression Constructs Other viral vectors may be employed as expression constructs in the present invention for the delivery of oligonucleotide or polynucleotide sequences to a host cell. Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Coupar et al., 1988), lentiviruses, polio viruses and herpes viruses may be employed. They offer several attractive features for various mammalian cells (Friedmann, 1989; Ridgeway, 1988; Coupar et al., 1988; Horwich et al., 1990).

[0141] With the recent recognition of defective hepatitis B viruses, new insight was gained into the structure-function relationship of different viral sequences. In vitro studies showed that the virus could retain the ability for helper-dependent packaging and reverse transcription despite the deletion of up to 80% of its genome (Horwich et al., 1990). This suggested that large portions of the genome could be replaced with foreign genetic material. The hepatotropism and persistence (integration) were particularly attractive properties for liver-directed gene transfer. Chang et al. (1991) introduced the chloramphenicol acetyltransferase (CAT) gene into duck hepatitis B virus genome in the place of the polymerase, surface, and pre-surface coding sequences. It was cotransfected with wild-type virus into an avian hepatoma cell line. Culture media containing high titers of the recombinant virus were used to infect primary duckling hepatocytes. Stable CAT gene expression was detected for at least 24 days after transfection (Chang et al., 1991).

[0142] 5. Non-Viral Vectors

[0143] In order to effect expression of the oligonucleotide or polynucleotide sequences of the present invention, the expression construct must be delivered into a cell. This delivery may be accomplished in vitro, as in laboratory procedures for transforming cells lines, or in vivo or ex vivo, as in the treatment of certain disease states. As described above, one preferred mechanism for delivery is via viral infection where the expression construct is encapsulated in an infectious viral particle.

[0144] Once the expression construct has been delivered into the cell the nucleic acid encoding the desired oligonucleotide or polynucleotide sequences may be positioned and expressed at different sites. In certain embodiments, the nucleic acid encoding the construct may be stably integrated into the genome of the cell. This integration may be in the specific location and orientation via homologous recombination (gene replacement) or it may be integrated in a random, non-specific location (gene augmentation). In yet further embodiments, the nucleic acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments or “episomes” encode sequences sufficient to permit maintenance and replication independent of or in synchronization with the host cell cycle. How the expression construct is delivered to a cell and where in the cell the nucleic acid remains is dependent on the type of expression construct employed.

[0145] In certain embodiments of the invention, the expression construct comprising one or more oligonucleotide or polynucleotide sequences may simply consist of naked recombinant DNA or plasmids. Transfer of the construct may be performed by any of the methods mentioned above which physically or chemically permeabilize the cell membrane. This is particularly applicable for transfer in vitro but it may be applied to in vivo use as well. Dubensky et al. (1984) successfully injected polyomavirus DNA in the form of calcium phosphate precipitates into liver and spleen of adult and newborn mice demonstrating active viral replication and acute infection. Benvenisty and Reshef (1986) also demonstrated that direct intraperitoneal injection of calcium phosphate-precipitated plasmids results in expression of the transfected genes. It is envisioned that DNA encoding a gene of interest may also be transferred in a similar manner in vivo and express the gene product.

[0146] Another embodiment of the invention for transferring a naked DNA expression construct into cells may involve particle bombardment. This method depends on the ability to accelerate DNA-coated microprojectiles to a high velocity allowing them to pierce cell membranes and enter cells without killing them (Klein et al., 1987). Several devices for accelerating small particles have been developed. One such device relies on a high voltage discharge to generate an electrical current, which in turn provides the motive force (Yang et al., 1990). The microprojectiles used have consisted of biologically inert substances such as tungsten or gold beads.

[0147] Selected organs including the liver, skin, and muscle tissue of rats and mice have been bombarded in vivo (Yang et al., 1990; Zelenin et al., 1991). This may require surgical exposure of the tissue or cells, to eliminate any intervening tissue between the gun and the target organ, i. e. ex vivo treatment. Again, DNA encoding a particular gene may be delivered via this method and still be incorporated by the present invention.

[0148] Antisense Oligonucleotides

[0149] The end result of the flow of genetic information is the synthesis of protein. DNA is transcribed by polymerases into messenger RNA and translated on the ribosome to yield a folded, functional protein. Thus there are several steps along the route where protein synthesis can be inhibited. The native DNA segment coding for a polypeptide described herein, as all such mammalian DNA strands, has two strands: a sense strand and an antisense strand held together by hydrogen bonding. The messenger RNA coding for polypeptide has the same nucleotide sequence as the sense DNA strand except that the DNA thymidine is replaced by uridine. Thus, synthetic antisense nucleotide sequences will bind to a MRNA and inhibit expression of the protein encoded by that mRNA.

[0150] The targeting of antisense oligonucleotides to mRNA is thus one mechanism to shut down protein synthesis, and, consequently, represents a powerful and targeted therapeutic approach. For example, the synthesis of polygalactauronase and the muscarine type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their respective mRNA sequences (U.S. Pat. Nos. 5,739,119 and 5,759,829, each specifically incorporated herein by reference in its entirety). Further, examples of antisense inhibition have been demonstrated with the nuclear protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, STK-1, striatal GABA_(A) receptor and human EGF (Jaskulski et al., 1988; Vasanthakumar and Ahmed, 1989; Peris et al., 1998; U.S. Pat. Nos. 5,801,154; 5,789,573; 5,718,709 and 5,610,288, each specifically incorporated herein by reference in its entirety). Antisense constructs have also been described that inhibit and can be used to treat a variety of abnormal cellular proliferations, e.g cancer (U.S. Pat. Nos. 5,747,470; 5,591,317 and 5,783,683, each specifically incorporated herein by reference in its entirety).

[0151] Therefore, in exemplary embodiments, the invention provides oligonucleotide sequences that comprise all, or a portion of, any sequence that is capable of specifically binding to polynucleotide sequence described herein, or a complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or derivatives thereof. In a third embodiment, the oligonucleotides are modified DNAs comprising a phosphorothioated modified backbone. In a fourth embodiment, the oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In each case, preferred compositions comprise a sequence region that is complementary, and more preferably substantially-complementary, and even more preferably, completely complementary to one or more portions of polynucleotides disclosed herein.

[0152] Selection of antisense compositions specific for a given gene sequence is based upon analysis of the chosen target sequence (i.e. in these illustrative examples the rat and human sequences) and determination of secondary structure, T_(m), binding energy, relative stability, and antisense compositions were selected based upon their relative inability to form dimers, hairpins, or other secondary structures that would reduce or prohibit specific binding to the target mRNA in a host cell.

[0153] Highly preferred target regions of the mRNA, are those which are at or near the AUG translation initiation codon, and those sequences which were substantially complementary to 5′ regions of the mRNA. These secondary structure analyses and target site selection considerations were performed using v.4 of the OLIGO primer analysis software (Rychlik, 1997) and the BLASTN 2.0.5 algorithm software (Altschul et al., 1997).

[0154] The use of an antisense delivery method employing a short peptide vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a hydrophobic domain derived from the fusion sequence of HIV gp4l and a hydrophilic domain from the nuclear localization sequence of SV40 T-antigen (Morris et al., 1997). It has been demonstrated that several molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). Further, the interaction with MPG strongly increases both the stability of the oligonucleotide to nuclease and the ability to cross the plasma membrane (Morris et al., 1997).

[0155] Ribozymes

[0156] Although proteins traditionally have been used for catalysis of nucleic acids, another class of macromolecules has emerged as useful in this endeavor. Ribozymes are RNA-protein complexes that cleave nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that possess endonuclease activity (Kim and Cech, 1987; Gerlach et al., 1987; Forster and Symons, 1987). For example, a large number of ribozymes accelerate phosphoester transfer reactions with a high degree of specificity, often cleaving only one of several phosphoesters in an oligonucleotide substrate (Cech et al., 1981; Michel and Westhof, 1990; Reinhold-Hurek and Shub, 1992). This specificity has been attributed to the requirement that the substrate bind via specific base-pairing interactions to the internal guide sequence (“IGS”) of the ribozyme prior to chemical reaction.

[0157] Ribozyme catalysis has primarily been observed as part of sequence-specific cleavage/ligation reactions involving nucleic acids (Joyce, 1989; Cech et al., 1981). For example, U.S. Pat. No. 5,354,855 (specifically incorporated herein by reference) reports that certain ribozymes can act as endonucleases with a sequence specificity greater than that of known ribonucleases and approaching that of the DNA restriction enzymes. Thus, sequence-specific ribozyme-mediated inhibition of gene expression may be particularly suited to therapeutic applications (Scanlon et al., 1991; Sarver et al., 1990). Recently, it was reported that ribozymes elicited genetic changes in some cells lines to which they were applied; the altered genes included the oncogenes H-ras, c-fos and genes of HIV. Most of this work involved the modification of a target mRNA, based on a specific mutant codon that is cleaved by a specific ribozyme.

[0158] Six basic varieties of naturally-occurring enzymatic RNAs are known presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and thus can cleave other RNA molecules) under physiological conditions. In general, enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs through the target binding portion of a enzymatic nucleic acid which is held in close proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through complementary base-pairing, and once bound to the correct site, acts enzymatically to cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and cleaved its RNA target, it is released from that RNA to search for another target and can repeatedly bind and cleave new targets.

[0159] The enzymatic nature of a ribozyme is advantageous over many technologies, such as antisense technology (where a nucleic acid molecule simply binds to a nucleic acid target to block its translation) since the concentration of ribozyme necessary to affect a therapeutic treatment is lower than that of an antisense oligonucleotide. This advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity of inhibition depending not only on the base pairing mechanism of binding to the target RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage can completely eliminate catalytic activity of a ribozyme. Similar mismatches in antisense molecules do not prevent their action (Woolf et al., 1992). Thus, the specificity of action of a ribozyme is greater than that of an antisense oligonucleotide binding the same RNA site.

[0160] The enzymatic nucleic acid molecule may be formed in a hammerhead, hairpin, a hepatitis 6 virus, group I intron or RNaseP RNA (in association with an RNA guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are described by Rossi et al. (1992). Examples of hairpin motifs are described by Hampel et al. (Eur. Pat. Appl. Publ. No. EP 0360257), Hampel and Tritz (1989), Hampel et al. (1990) and U.S. Pat. No. 5,631,359 (specifically incorporated herein by reference). An example of the hepatitis δ virus motif is described by Perrotta and Been (1992); an example of the RNaseP motif is described by Guerrier-Takada et al. (1983); Neurospora VS RNA ribozyme motif is described by Collins (Saville and Collins, 1990; Saville and Collins, 1991; Collins and Olive, 1993); and an example of the Group I intron is described in (U.S. Pat. No. 4,987,071, specifically incorporated herein by reference). All that is important in an enzymatic nucleic acid molecule of this invention is that it has a specific substrate binding site which is complementary to one or more of the target gene RNA regions, and that it have nucleotide sequences within or surrounding that substrate binding site which impart an RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be limited to specific motifs mentioned herein.

[0161] In certain embodiments, it may be important to produce enzymatic cleaving agents which exhibit a high degree of specificity for the RNA of a desired target, such as one of the sequences disclosed herein. The enzymatic nucleic acid molecule is preferably targeted to a highly conserved sequence region of a target mRNA. Such enzymatic nucleic acid molecules can be delivered exogenously to specific cells as required. Alternatively, the ribozymes can be expressed from DNA or RNA vectors that are delivered to specific cells.

[0162] Small enzymatic nucleic acid motifs (e.g., of the hammerhead or the hairpin structure) may also be used for exogenous delivery. The simple structure of these molecules increases the ability of the enzymatic nucleic acid to invade targeted regions of the mRNA structure. Alternatively, catalytic RNA molecules can be expressed within cells from eukaryotic promoters (e.g., Scanlon et al., 1991; Kashani-Sabet et al., 1992; Dropulic et al., 1992; Weerasinghe et al., 1991; Ojwang et al., 1992; Chen et al., 1992; Sarver et al., 1990). Those skilled in the art realize that any ribozyme can be expressed in eukaryotic cells from the appropriate DNA vector. The activity of such ribozymes can be augmented by their release from the primary transcript by a second ribozyme (Int. Pat. Appl. Publ. No. WO 93/23569, and Int. Pat. Appl. Publ. No. WO 94/02595, both hereby incorporated by reference; Ohkawa et al., 1992; Taira et al., 1991; and Ventura et aL, 1993).

[0163] Ribozymes may be added directly, or can be complexed with cationic lipids, lipid complexes, packaged within liposomes, or otherwise delivered to target cells. The RNA or RNA complexes can be locally administered to relevant tissues ex vivo, or in vivo through injection, aerosol inhalation, infusion pump or stent, with or without their incorporation in biopolymers.

[0164] Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as described. Such ribozymes can also be optimized for delivery. While specific examples are provided, those in the art will recognize that equivalent RNA targets in other species can be utilized when necessary.

[0165] Hammerhead or hairpin ribozymes may be individually analyzed by computer folding (Jaeger et aL, 1989) to assess whether the ribozyme sequences fold into the appropriate secondary structure. Those ribozymes with unfavorable intramolecular interactions between the binding arms and the catalytic core are eliminated from consideration. Varying binding arm lengths can be chosen to optimize activity. Generally, at least 5 or so bases on each arm are able to bind to, or otherwise interact with, the target RNA.

[0166] Ribozymes of the hammerhead or hairpin motif may be designed to anneal to various sites in the mRNA message, and can be chemically synthesized. The method of synthesis used follows the procedure for normal RNA synthesis as described in Usman etal. (1987) and in Scaringe etal (1990) and makes use of common nucleic acid protecting and coupling groups, such as dimethoxytrityl at the 5′-end, and phosphoramidites at the 3′-end. Average stepwise coupling yields are typically>98%. Hairpin ribozymes may be synthesized in two parts and annealed to reconstruct an active ribozyme (Chowrira and Burke, 1992). Ribozymes may be modified extensively to enhance stability by modification with nuclease resistant groups, for example, 2′-amino, 2′-C-allyl, 2′-flouro, 2′-o-methyl, 2′-H (for a review see e.g., Usman and Cedergren, 1992). Ribozymes may be purified by gel electrophoresis using general methods or by high pressure liquid chromatography and resuspended in water.

[0167] Ribozyme activity can be optimized by altering the length of the ribozyme binding arms, or chemically synthesizing ribozymes with modifications that prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. Publ. No. WO 92/07065; Perrault et al, 1990; Pieken et al., 1991; Usman and Cedergren, 1992; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U.S. Pat. No. 5,334,711; and Int. Pat. Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can be made to the sugar moieties of enzymatic RNA molecules), modifications which enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis times and reduce chemical requirements.

[0168] Sullivan etal. (Int. Pat. Appl. Publ. No. WO 94/02595) describes the general methods for delivery of enzymatic RNA molecules. Ribozymes may be administered to cells by a variety of methods known to those familiar to the art, including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. Alternatively, the RNA/vehicle combination may be locally delivered by direct inhalation, by direct injection or by use of a catheter, infusion pump or stent. Other routes of delivery include, but are not limited to, intravascular, intramuscular, subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated herein by reference.

[0169] Another means of accumulating high concentrations of a ribozyme(s) within cells is to incorporate the ribozyme-encoding sequences into a DNA expression vector. Transcription of the ribozyme sequences are driven from a promoter for eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase III (pol III). Transcripts from pol II or pol III promoters will be expressed at high levels in all cells; the levels of a given pol II promoter in a given cell type will depend on the nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. Prokaryotic RNA polymerase promoters may also be used, providing that the prokaryotic RNA polymerase enzyme is expressed in the appropriate cells (Elroy-Stein and Moss, 1990; Gao and Huang, 1993; Lieber et al., 1993; Zhou et al., 1990). Ribozymes expressed from such promoters can function in mammalian cells (e.g. Kashani-Saber etal., 1992; Ojwang et al., 1992; Chen et al., 1992; Yu et al., 1993; L'Huillier et al., 1992; Lisziewicz et al., 1993). Such transcription units can be incorporated into a variety of vectors for introduction into mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as retroviral, semliki forest virus, sindbis virus vectors).

[0170] Ribozymes may be used as diagnostic tools to examine genetic drift and mutations within diseased cells. They can also be used to assess levels of the target RNA molecule. The close relationship between ribozyme activity and the structure of the target RNA allows the detection of mutations in any region of the molecule which alters the base-pairing and three-dimensional structure of the target RNA. By using multiple ribozymes, one may map nucleotide changes which are important to RNA structure and function in vitro, as well as in cells and tissues. Cleavage of target RNAs with ribozymes may be used to inhibit gene expression and define the role (essentially) of specified gene products in the progression of disease. In this manner, other genetic targets may be defined as important mediators of the disease. These studies will lead to better treatment of the disease progression by affording the possibility of combinational therapies (e.g., multiple ribozymes targeted to different genes, ribozymes coupled with known small molecule inhibitors, or intermittent treatment with combinations of ribozymes and/or other chemical or biological molecules). Other in vitro uses of ribozymes are well known in the art, and include detection of the presence of mRNA associated with an IL-5 related condition. Such RNA is detected by determining the presence of a cleavage product after treatment with a ribozyme using standard methodology.

[0171] Peptide Nucleic Acids

[0172] In certain embodiments, the inventors contemplate the use of peptide nucleic acids (PNAs) in the practice of the methods of the invention. PNA is a DNA mimic in which the nucleobases are attached to a pseudopeptide backbone (Good and Nielsen, 1997). PNA is able to be utilized in a number methods that traditionally have used RNA or DNA. Often PNA sequences perform better in techniques than the corresponding RNA or DNA sequences and have utilities that are not inherent to RNA or DNA. A review of PNA including methods of making, characteristics of, and methods of using, is provided by Corey (1997) and is incorporated herein by reference. As such, in certain embodiments, one may prepare PNA sequences that are complementary to one or more portions of the ACE mRNA sequence, and such PNA compositions may be used to regulate, alter, decrease, or reduce the translation of ACE-specific MRNA, and thereby alter the level of ACE activity in a host cell to which such PNA compositions have been administered.

[0173] PNAs have 2-aminoethyl-glycine linkages replacing the normal phosphodiester backbone of DNA (Nielsen et al., 1991; Hanvey et al., 1992; Hyrup and Nielsen, 1996; Neilsen, 1996). This chemistry has three important consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc (Dueholm et al., 1994) or Fmoc (Thomson et al., 1995) protocols for solid-phase peptide synthesis, although other methods, including a modified Merrifield method, have been used (Christensen et al., 1995).

[0174] PNA monomers or ready-made oligomers are commercially available from PerSeptive Biosystems (Framingham, Mass.). PNA syntheses by either Boc or Fmoc protocols are straightforward using manual or automated protocols (Norton et al., 1995). The manual protocol lends itself to the production of chemically modified PNAs or the simultaneous synthesis of families of closely related PNAs.

[0175] As with peptide synthesis, the success of a particular PNA synthesis will depend on the properties of the chosen sequence. For example, while in theory PNAs can incorporate any combination of nucleotide bases, the presence of adjacent purines can lead to deletions of one or more residues in the product. In expectation of this difficulty, it is suggested that, in producing PNAs with adjacent purines, one should repeat the coupling of residues likely to be added inefficiently. This should be followed by the purification of PNAs by reverse-phase high-pressure liquid chromatography (Norton et al., 1995) providing yields and purity of product similar to those observed during the synthesis of peptides.

[0176] Modifications of PNAs for a given application may be accomplished by coupling amino acids during solid-phase synthesis or by attaching compounds that contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs can be modified after synthesis by coupling to an introduced lysine or cysteine. The ease with which PNAs can be modified facilitates optimization for better solubility or for specific functional requirements. Once synthesized, the identity of PNAs and their derivatives can be confirmed by mass spectrometry. Several studies have made and utilized modifications of PNAs (Norton et al., 1995; Haaima et al., 1996; Stetsenko et al., 1996; Petersen et al., 1995; Ulmann et al., 1996; Koch et al., 1995; Orum et al., 1995; Footer et al., 1996; Griffith et al., 1995; Kremsky et al., 1996; Pardridge et al., 1995; Boffa et al., 1995; Landsdorp et al., 1996; Gambacorti-Passerini et al., 1996; Armitage et al., 1997; Seeger et al., 1997; Ruskowski et aL, 1997). U.S. Pat. No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in diagnostics, modulating protein in organisms, and treatment of conditions susceptible to therapeutics.

[0177] In contrast to DNA and RNA, which contain negatively charged linkages, the PNA backbone is neutral. In spite of this dramatic alteration, PNAs recognize complementary DNA and RNA by Watson-Crick pairing (Egholm et al., 1993), validating the initial modeling by Nielsen et al. (1991). PNAs lack 3′ to 5′ polarity and can bind in either parallel or antiparallel fashion, with the antiparallel mode being preferred (Egholm et al., 1993).

[0178] Hybridization of DNA oligonucleotides to DNA and RNA is destabilized by electrostatic repulsion between the negatively charged phosphate backbones of the complementary strands. By contrast, the absence of charge repulsion in PNA-DNA or PNA-RNA duplexes increases the melting temperature (T_(m)) and reduces the dependence of T_(m) on the concentration of mono- or divalent cations (Nielsen et al., 1991). The enhanced rate and affinity of hybridization are significant because they are responsible for the surprising ability of PNAs to perform strand invasion of complementary sequences within relaxed double-stranded DNA. In addition, the efficient hybridization at inverted repeats suggests that PNAs can recognize secondary structure effectively within double-stranded DNA. Enhanced recognition also occurs with PNAs immobilized on surfaces, and Wang et al. have shown that support-bound PNAs can be used to detect hybridization events (Wang et al., 1996).

[0179] One might expect that tight binding of PNAs to complementary sequences would also increase binding to similar (but not identical) sequences, reducing the sequence specificity of PNA recognition. As with DNA hybridization, however, selective recognition can be achieved by balancing oligomer length and incubation temperature. Moreover, selective hybridization of PNAs is encouraged by PNA-DNA hybridization being less tolerant of base mismatches than DNA-DNA hybridization. For example, a single mismatch within a 16 bp PNA-DNA duplex can reduce the T_(m) by up to 15° C. (Egholm et al., 1993). This high level of discrimination has allowed the development of several PNA-based strategies for the analysis of point mutations (Wang et al., 1996; Carlsson et al., 1996; Thiede et al., 1996; Webb and Hurskainen, 1996; Perry-O'Keefe et al., 1996).

[0180] High-affinity binding provides clear advantages for molecular recognition and the development of new applications for PNAs. For example, 11-13 nucleotide PNAs inhibit the activity of telomerase, a ribonucleo-protein that extends telomere ends using an essential RNA template, while the analogous DNA oligomers do not (Norton et al., 1996).

[0181] Neutral PNAs are more hydrophobic than analogous DNA oligomers, and this can lead to difficulty solubilizing them at neutral pH, especially if the PNAs have a high purine content or if they have the potential to form secondary structures. Their solubility can be enhanced by attaching one or more positive charges to the PNA termini (Nielsen et al., 1991).

[0182] Findings by Allfrey and colleagues suggest that strand invasion will occur spontaneously at sequences within chromosomal DNA (Boffa et al., 1995; Boffa et al., 1996). These studies targeted PNAs to triplet repeats of the nucleotides CAG and used this recognition to purify transcriptionally active DNA (Boffa et al., 1995) and to inhibit transcription (Boffa et al., 1996). This result suggests that if PNAs can be delivered within cells then they will have the potential to be general sequence-specific regulators of gene expression. Studies and reviews concerning the use of PNAs as antisense and anti-gene agents include Nielsen et al. (1993b), Hanvey et al. (1992), and Good and Nielsen (1997). Koppelhus et al. (1997) have used PNAs to inhibit HIV-1 inverse transcription, showing that PNAs may be used for antiviral therapies.

[0183] Methods of characterizing the antisense binding properties of PNAs are discussed in Rose (1993) and Jensen et al. (1997). Rose uses capillary gel electrophoresis to determine binding of PNAs to their complementary oligonucleotide, measuring the relative binding kinetics and stoichiometry. Similar types of measurements were made by Jensen et al. using BIAcore™ technology.

[0184] Other applications of PNAs include use in DNA strand invasion (Nielsen et al., 1991), antisense inhibition (Hanvey et al., 1992), mutational analysis (Orum et al., 1993), enhancers of transcription (Mollegaard et al, 1994), nucleic acid purification (Orum et al., 1995), isolation of transcriptionally active genes (Boffa et al., 1995), blocking of transcription factor binding (Vickers et al., 1995), genome cleavage (Veselkov et al., 1996), biosensors (Wang et al., 1996), in situ hybridization (Thisted et al., 1996), and in a alternative to Southern blotting (Perry-O'Keefe, 1996).

[0185] Polypeptide Compositions

[0186] The present invention, in other aspects, provides polypeptide compositions. Generally, a polypeptide of the invention will be an isolated polypeptide (or an epitope, variant, or active fragment thereof derived from a mammalian species. Preferably, the polypeptide is encoded by a polynucleotide sequence disclosed herein or a sequence which hybridizes under moderately stringent conditions to a polynucleotide sequence disclosed herein. Alternatively, the polypeptide may be defined as a polypeptide which comprises a contiguous amino acid sequence from an amino acid sequence disclosed herein, or which polypeptide comprises an entire amino acid sequence disclosed herein.

[0187] In the present invention, a polypeptide composition is also understood to comprise one or more polypeptides that are immunologically reactive with antibodies generated against a polypeptide of the invention, particularly a polypeptide having the amino acid sequence disclosed in SEQ ID NO:1-38, or to active fragments, or to variants or biological functional equivalents thereof.

[0188] Likewise, a polypeptide composition of the present invention is understood to comprise one or more polypeptides that are capable of eliciting antibodies that are immunologically reactive with one or more polypeptides encoded by one or more contiguous nucleic acid sequences contained in SEQ ID NO:1-38, or to active fragments, or to variants thereof, or to one or more nucleic acid sequences which hybridize to one or more of these sequences under conditions of moderate to high stringency. Particularly illustrative polypeptides include the amino acid sequence disclosed in SEQ ID NO:1-38.

[0189] As used herein, an active fragment of a polypeptide includes a whole or a portion of a polypeptide which is modified by conventional techniques, e.g., mutagenesis, or by addition, deletion, or substitution, but which active fragment exhibits substantially the same structure function, antigenicity, etc., as a polypeptide as described herein.

[0190] In certain illustrative embodiments, the polypeptides of the invention will comprise at least an immunogenic portion of an ovarian tumor protein or a variant thereof, as described herein. As noted above, an “ovarian tumor protein” is a protein that is expressed by ovarian tumor cells. Proteins that are ovarian tumor proteins also react detectably within an immunoassay (such as an ELISA) with antisera from a patient with ovarian cancer. Polypeptides as described herein may be of any length. Additional sequences derived from the native protein and/or heterologous sequences may be present, and such sequences may (but need not) possess further immunogenic or antigenic properties.

[0191] An “immunogenic portion,” as used herein is a portion of a protein that is recognized (i.e., specifically bound) by a B-cell and/or T-cell surface antigen receptor. Such immunogenic portions generally comprise at least 5 amino acid residues, more preferably at least 10, and still more preferably at least 20 amino acid residues of an ovarian tumor protein or a variant thereof. Certain preferred immunogenic portions include peptides in which an N-terminal leader sequence and/or transmembrane domain have been deleted. Other preferred immunogenic portions may contain a small N- and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), relative to the mature protein.

[0192] Immunogenic portions may generally be identified using well known techniques, such as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) and references cited therein. Such techniques include screening polypeptides for the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or clones. As used herein, antisera and antibodies are “antigen-specific” if they specifically bind to an antigen (i.e., they react with the protein in an ELISA or other immunoassay, and do not react detectably with unrelated proteins). Such antisera and antibodies may be prepared as described herein, and using well known techniques. An immunogenic portion of a native ovarian tumor protein is a portion that reacts with such antisera and/or T-cells at a level that is not substantially less than the reactivity of the full length polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Such immunogenic portions may react within such assays at a level that is similar to or greater than the reactivity of the full length polypeptide. Such screens may generally be performed using methods well known to those of ordinary skill in the art, such as those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. For example, a polypeptide may be immobilized on a solid support and contacted with patient sera to allow binding of antibodies within the sera to the immobilized polypeptide. Unbound sera may then be removed and bound antibodies detected using, for example, ¹²⁵I-labeled Protein A.

[0193] As noted above, a composition may comprise a variant of a native ovarian tumor protein. A polypeptide “variant,” as used herein, is a polypeptide that differs from a native ovarian tumor protein in one or more substitutions, deletions, additions and/or insertions, such that the immunogenicity of the polypeptide is not substantially diminished. In other words, the ability of a variant to react with antigen-specific antisera may be enhanced or unchanged, relative to the native protein, or may be diminished by less than 50%, and preferably less than 20%, relative to the native protein. Such variants may generally be identified by modifying one of the above polypeptide sequences and evaluating the reactivity of the modified polypeptide with antigen-specific antibodies or antisera as described herein. Preferred variants include those in which one or more portions, such as an N-terminal leader sequence or transmembrane domain, have been removed. Other preferred variants include variants in which a small portion (e.g., 1-30amino acids, preferably 5-15 amino acids) has been removed from the N- and/or C-terminal of the mature protein.

[0194] Polypeptide variants encompassed by the present invention include those exhibiting at least about 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96% 97%, 98%, or 99% or more identity (determined as described above) to the polypeptides disclosed herein.

[0195] Preferably, a variant contains conservative substitutions. A “conservative substitution” is one in which an amino acid is substituted for another amino acid that has similar properties, such that one skilled in the art of peptide chemistry would expect the secondary structure and hydropathic nature of the polypeptide to be substantially unchanged. Amino acid substitutions may generally be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the residues. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may represent conservative changes include: (1) ala, pro, gly, glu, asp, gln, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. A variant may also, or alternatively, contain nonconservative changes. In a preferred embodiment, variant polypeptides differ from a native sequence by substitution, deletion or addition of five amino acids or fewer. Variants may also (or alternatively) be modified by, for example, the deletion or addition of amino acids that have minimal influence on the immunogenicity, secondary structure and hydropathic nature of the polypeptide.

[0196] As noted above, polypeptides may comprise a signal (or leader) sequence at the N-terminal end of the protein, which co-translationally or post-translationally directs transfer of the protein. The polypeptide may also be conjugated to a linker or other sequence for ease of synthesis, purification or identification of the polypeptide (e.g.,poly-His), or to enhance binding of the polypeptide to a solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc region.

[0197] Polypeptides may be prepared using any of a variety of well known techniques. Recombinant polypeptides encoded by DNA sequences as described above may be readily prepared from the DNA sequences using any of a variety of expression vectors known to those of ordinary skill in the art. Expression may be achieved in any appropriate host cell that has been transformed or transfected with an expression vector containing a DNA molecule that encodes a recombinant polypeptide. Suitable host cells include prokaryotes, yeast, and higher eukaryotic cells, such as mammalian cells and plant cells. Preferably, the host cells employed are E. coli, yeast or a mammalian cell line such as COS or CHO. Supernatants from suitable host/vector systems which secrete recombinant protein or polypeptide into culture media may be first concentrated using a commercially available filter. Following concentration, the concentrate may be applied to a suitable purification matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse phase HPLC steps can be employed to further purify a recombinant polypeptide.

[0198] Portions and other variants having less than about 100 amino acids, and generally less than about 50 amino acids, may also be generated by synthetic means, using techniques well known to those of ordinary skill in the art. For example, such polypeptides may be synthesized using any of the commercially available solid-phase techniques, such as the Merrifield solid-phase synthesis method, where amino acids are sequentially added to a growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146, 1963. Equipment for automated synthesis of polypeptides is commercially available from suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, Calif.), and may be operated according to the manufacturer's instructions.

[0199] Within certain specific embodiments, a polypeptide may be a fusion protein that comprises multiple polypeptides as described herein, or that comprises at least one polypeptide as described herein and an unrelated sequence, such as a known tumor protein. A fusion partner may, for example, assist in providing T helper epitopes (an immunological fusion partner), preferably T helper epitopes recognized by humans, or may assist in expressing the protein (an expression enhancer) at higher yields than the native recombinant protein. Certain preferred fusion partners are both immunological and expression enhancing fusion partners. Other fusion partners may be selected so as to increase the solubility of the protein or to enable the protein to be targeted to desired intracellular compartments. Still further fusion partners include affinity tags, which facilitate purification of the protein.

[0200] Fusion proteins may generally be prepared using standard techniques, including chemical conjugation. Preferably, a fusion protein is expressed as a recombinant protein, allowing the production of increased levels, relative to a non-fused protein, in an expression system. Briefly, DNA sequences encoding the polypeptide components may be assembled separately, and ligated into an appropriate expression vector. The 3′ end of the DNA sequence encoding one polypeptide component is ligated, with or without a peptide linker, to the 5′ end of a DNA sequence encoding the second polypeptide component so that the reading frames of the sequences are in phase. This permits translation into a single fusion protein that retains the biological activity of both component polypeptides.

[0201] A peptide linker sequence may be employed to separate the first and second polypeptide components by a distance sufficient to ensure that each polypeptide folds into its secondary and tertiary structures. Such a peptide linker sequence is incorporated into the fusion protein using standard techniques well known in the art. Suitable peptide linker sequences may be chosen based on the following factors: (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a secondary structure that could interact with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic or charged residues that might react with the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which may be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., Proc. Natl. Acad. Sci. USA 83:8258-8262, 1986; U.S. Pat. Nos. 4,935,233 and 4,751,180. The linker sequence may generally be from 1 to about 50 amino acids in length. Linker sequences are not required when the first and second polypeptides have non-essential N-terminal amino acid regions that can be used to separate the functional domains and prevent steric interference.

[0202] The ligated DNA sequences are operably linked to suitable transcriptional or translational regulatory elements. The regulatory elements responsible for expression of DNA are located only 5′ to the DNA sequence encoding the first polypeptides. Similarly, stop codons required to end translation and transcription termination signals are only present 3′ to the DNA sequence encoding the second polypeptide.

[0203] Fusion proteins are also provided. Such proteins comprise a polypeptide as described herein together with an unrelated immunogenic protein. Preferably the immunogenic protein is capable of eliciting a recall response. Examples of such proteins include tetanus, tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 336:86-91, 1997).

[0204] Within preferred embodiments, an immunological fusion partner is derived from protein D, a surface protein of the gram-negative bacterium Haemophilus influenza B (WO 91/18926). Preferably, a protein D derivative comprises approximately the first third of the protein (e.g., the first N-terminal 100-110 amino acids), and a protein D derivative may be lipidated. Within certain preferred embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to increase the expression level in E. coli (thus functioning as an expression enhancer). The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. Other fusion partners include the non-structural protein from influenzae virus, NSl (hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different fragments that include T-helper epitopes may be used.

[0205] In another embodiment, the immunological fusion partner is the protein known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to the choline or to some choline analogues such as DEAE. This property has been exploited for the development of E. coli C-LYTA expressing plasmids useful for expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA fragment at the amino terminus has been described (see Biotechnology 10:795-798, 1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated into a fusion protein. A repeat portion is found in the C-terminal region starting at residue 178. A particularly preferred repeat portion incorporates residues 188-305.

[0206] In general, polypeptides (including fusion proteins) and polynucleotides as described herein are isolated. An “isolated” polypeptide or polynucleotide is one that is removed from its original environment. For example, a naturally-occurring protein is isolated if it is separated from some or all of the coexisting materials in the natural system. Preferably, such polypeptides are at least about 90% pure, more preferably at least about 95% pure and most preferably at least about 99% pure. A polynucleotide is considered to be isolated if, for example, it is cloned into a vector that is not a part of the natural environment.

[0207] Binding Agents

[0208] The present invention further provides agents, such as antibodies and antigen-binding fragments thereof, that specifically bind to an ovarian tumor protein. As used herein, an antibody, or antigen-binding fragment thereof, is said to “specifically bind” to an ovarian tumor protein if it reacts at a detectable level (within, for example, an ELISA) with an ovarian tumor protein, and does not react detectably with unrelated proteins under similar conditions. As used herein, “binding” refers to a noncovalent association between two separate molecules such that a complex is formed. The ability to bind may be evaluated by, for example, determining a binding constant for the formation of the complex. The binding constant is the value obtained when the concentration of the complex is divided by the product of the component concentrations. In general, two compounds are said to “bind,” in the context of the present invention, when the binding constant for complex formation exceeds about 10³ L/mol. The binding constant may be determined using methods well known in the art.

[0209] Binding agents may be further capable of differentiating between patients with and without a cancer, such as ovarian cancer, using the representative assays provided herein. In other words, antibodies or other binding agents that bind to an ovarian tumor protein will generate a signal indicating the presence of a cancer in at least about 20% of patients with the disease, and will generate a negative signal indicating the absence of the disease in at least about 90% of individuals without the cancer. To determine whether a binding agent satisfies this requirement, biological samples (e.g., blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a cancer (as determined using standard clinical tests) may be assayed as described herein for the presence of polypeptides that bind to the binding agent. It will be apparent that a statistically significant number of samples with and without the disease should be assayed. Each binding agent should satisfy the above criteria; however, those of ordinary skill in the art will recognize that binding agents may be used in combination to improve sensitivity.

[0210] Any agent that satisfies the above requirements may be a binding agent. For example, a binding agent may be a ribosome, with or without a peptide component, an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In general, antibodies can be produced by cell culture techniques, including the generation of monoclonal antibodies as described herein, or via transfection of antibody genes into suitable bacterial or mammalian cell hosts, in order to allow for the production of recombinant antibodies. In one technique, an immunogen comprising the polypeptide is initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep or goats). In this step, the polypeptides of this invention may serve as the immunogen without modification. Alternatively, particularly for relatively short polypeptides, a superior immune response may be elicited if the polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen is injected into the animal host, preferably according to a predetermined schedule incorporating one or more booster immunizations, and the animals are bled periodically. Polyclonal antibodies specific for the polypeptide may then be purified from such antisera by, for example, affinity chromatography using the polypeptide coupled to a suitable solid support.

[0211] Monoclonal antibodies specific for an antigenic polypeptide of interest may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. Immunol. 6:511-519, 1976, and improvements thereto. Briefly, these methods involve the preparation of immortal cell lines capable of producing antibodies having the desired specificity (i.e., reactivity with the polypeptide of interest). Such cell lines may be produced, for example, from spleen cells obtained from an animal immunized as described above. The spleen cells are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably one that is syngeneic with the immunized animal. A variety of fusion techniques may be employed. For example, the spleen cells and myeloma cells may be combined with a nonionic detergent for a few minutes and then plated at low density on a selective medium that supports the growth of hybrid cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single colonies are selected and their culture supernatants tested for binding activity against the polypeptide. Hybridomas having high reactivity and specificity are preferred.

[0212] Monoclonal antibodies may be isolated from the supernatants of growing hybridoma colonies. In addition, various techniques may be employed to enhance the yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or the blood. Contaminants may be removed from the antibodies by conventional techniques, such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this invention may be used in the purification process in, for example, an affinity chromatography step.

[0213] Within certain embodiments, the use of antigen-binding fragments of antibodies may be preferred. Such fragments include Fab fragments, which may be prepared using standard techniques. Briefly, immunoglobulins may be purified from rabbit serum by affinity chromatography on Protein A bead columns (Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988) and digested by papain to yield Fab and Fc fragments. The Fab and Fc fragments may be separated by affinity chromatography on protein A bead columns.

[0214] Monoclonal antibodies of the present invention may be coupled to one or more therapeutic agents. Suitable agents in this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives thereof. Preferred radionuclides include ⁹⁰Y, ¹²³I, ¹²⁵I, ¹³¹I ¹⁸⁶Re, ¹⁸⁸Re, ²¹²At, and ²¹²Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed antiviral protein.

[0215] A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction between an agent and an antibody is possible when each possesses a substituent capable of reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl-containing group, such as an anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a halide) on the other.

[0216] Alternatively, it may be desirable to couple a therapeutic agent and an antibody via a linker group. A linker group can function as a spacer to distance an antibody from an agent in order to avoid interference with binding capabilities. A linker group can also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate the use of agents, or functional groups on agents, which otherwise would not be possible.

[0217] It will be evident to those skilled in the art that a variety of bifunctional or polyfunctional reagents, both homo- and hetero-functional (such as those described in the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker group. Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl groups or oxidized carbohydrate residues. There are numerous references describing such methodology, e.g., U.S. Pat. No. 4,671,958, to Rodwell et al.

[0218] Where a therapeutic agent is more potent when free from the antibody portion of the imnuunoconjugates of the present invention, it may be desirable to use a linker group which is cleavable during or upon internalization into a cell. A number of different cleavable linker groups have been described. The mechanisms for the intracellular release of an agent from these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Pat. No. 4,489,710, to Spitler), by irradiation of a photolabile bond (e.g., U.S. Pat. No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino acid side chains (e.g., U.S. Pat. No. 4,638,045, to Kohn et al.), by serum complement-mediated hydrolysis (e.g., U.S. Pat. No. 4,671,958, to Rodwell et aL), and acid-catalyzed hydrolysis (e.g., U.S. Pat. No. 4,569,789, to Blattler et al.).

[0219] It may be desirable to couple more than one agent to an antibody. In one embodiment, multiple molecules of an agent are coupled to one antibody molecule. In another embodiment, more than one type of agent may be coupled to one antibody. Regardless of the particular embodiment, immunoconjugates with more than one agent may be prepared in a variety of ways. For example, more than one agent may be coupled directly to an antibody molecule, or linkers that provide multiple sites for attachment can be used. Alternatively, a carrier can be used.

[0220] A carrier may bear the agents in a variety of ways, including covalent bonding either directly or via a linker group. Suitable carriers include proteins such as albumins (e.g., U.S. Pat. No. 4,507,234, to Kato et al.), peptides and polysaccharides such as aminodextran (e.g., U.S. Pat. No. 4,699,784, to Shih et aL). A carrier may also bear an agent by noncovalent bonding or by encapsulation, such as within a liposome vesicle (e.g., U.S. Pat. Nos. 4,429,008 and 4,873,088). Carriers specific for radionuclide agents include radiohalogenated small molecules and chelating compounds. For example, U.S. Pat. No. 4,735,792 discloses representative radiohalogenated small molecules and their synthesis. A radionuclide chelate may be formed from chelating compounds that include those containing nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For example, U.S. Pat. No. 4,673,562, to Davison et al. discloses representative chelating compounds and their synthesis.

[0221] A variety of routes of administration for the antibodies and immunoconjugates may be used. Typically, administration will be intravenous, intramuscular, subcutaneous or in the bed of a resected tumor. It will be evident that the precise dose of the antibody/immunoconjugate will vary depending upon the antibody used, the antigen density on the tumor, and the rate of clearance of the antibody.

[0222] T Cells

[0223] Immunotherapeutic compositions may also, or alternatively, comprise T cells specific for an ovarian tumor protein. Such cells may generally be prepared in vitro or ex vivo, using standard procedures. For example, T cells may be isolated from bone marrow, peripheral blood, or a fraction of bone marrow or peripheral blood of a patient, using a commercially available cell separation system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, Calif.; see also U.S. Pat. Nos. 5,240,856; 5,215,926; WO 89/06280; WO 91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or unrelated humans, non-human mammals, cell lines or cultures.

[0224] T cells may be stimulated with an ovarian tumor polypeptide, polynucleotide encoding an ovarian tumor polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. Such stimulation is performed under conditions and for a time sufficient to permit the generation of T cells that are specific for the polypeptide. Preferably, an ovarian tumor polypeptide or polynucleotide is present within a delivery vehicle, such as a microsphere, to facilitate the generation of specific T cells.

[0225] T cells are considered to be specific for an ovarian tumor polypeptide if the T cells specifically proliferate, secrete cytokines or kill target cells coated with the polypeptide or expressing a gene encoding the polypeptide. T cell specificity may be evaluated using any of a variety of standard techniques. For example, within a chromium release assay or proliferation assay, a stimulation index of more than two fold increase in lysis and/or proliferation, compared to negative controls, indicates T cell specificity. Such assays may be performed, for example, as described in Chen et al., Cancer Res. 54:1065-1070, 1994. Alternatively, detection of the proliferation of T cells may be accomplished by a variety of known techniques. For example, T cell proliferation can be detected by measuring an increased rate of DNA synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and measuring the amount of tritiated thymidine incorporated into DNA). Contact with an ovarian tumor polypeptide (100 ng/ml-100 μg/ml, preferably 200 ng/ml-25 μg/ml) for 3-7 days should result in at least a two fold increase in proliferation of the T cells. Contact as described above for 2-3 hours should result in activation of the T cells, as measured using standard cytokine assays in which a two fold increase in the level of cytokine release (e.g., TNF or IFN-γ) is indicative of T cell activation (see Coligan et al., Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T cells that have been activated in response to an ovarian tumor polypeptide, polynucleotide or polypeptide-expressing APC may be CD4⁺ and/or CD8⁺. Ovarian tumor protein-specific T cells may be expanded using standard techniques. Within preferred embodiments, the T cells are derived from a patient, a related donor or an unrelated donor, and are administered to the patient following stimulation and expansion.

[0226] For therapeutic purposes, CD4⁺ or CD8⁺ T cells that proliferate in response to an ovarian tumor polypeptide, polynucleotide or APC can be expanded in number either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a variety of ways. For example, the T cells can be re-exposed to an ovarian tumor polypeptide, or a short peptide corresponding to an immunogenic portion of such a polypeptide, with or without the addition of T cell growth factors, such as interleukin-2, and/or stimulator cells that synthesize an ovarian tumor polypeptide. Alternatively, one or more T cells that proliferate in the presence of an ovarian tumor protein can be expanded in number by cloning. Methods for cloning cells are well known in the art, and include limiting dilution.

[0227] Pharmaceutical Compositions

[0228] In additional embodiments, the present invention concerns formulation of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions disclosed herein in pharmaceutically-acceptable solutions for administration to a cell or an animal, either alone, or in combination with one or more other modalities of therapy.

[0229] It will also be understood that, if desired, the nucleic acid segment, RNA, DNA or PNA compositions that express a polypeptide as disclosed herein may be administered in combination with other agents as well, such as, e.g., other proteins or polypeptides or various pharmaceutically-active agents. In fact, there is virtually no limit to other components that may also be included, given that the additional agents do not cause a significant adverse effect upon contact with the target cells or host tissues. The compositions may thus be delivered along with various other agents as required in the particular instance. Such compositions may be purified from host cells or other biological sources, or alternatively may be chemically synthesized as described herein. Likewise, such compositions may further comprise substituted or derivatized RNA or DNA compositions.

[0230] Formulation of pharmaceutically-acceptable excipients and carrier solutions is well-known to those of skill in the art, as is the development of suitable dosing and treatment regimens for using the particular compositions described herein in a variety of treatment regimens, including e.g., oral, parenteral, intravenous, intranasal, and intramuscular administration and formulation.

[0231] 1. Oral Delivery

[0232] In certain applications, the pharmaceutical compositions disclosed herein may be delivered via oral administration to an animal. As such, these compositions may be formulated with an inert diluent or with an assimilable edible carrier, or they may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into tablets, or they may be incorporated directly with the food of the diet.

[0233] The active compounds may even be incorporated with excipients and used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, suspensions, syrups, wafers, and the like (Mathiowitz et al., 1997; Hwang et al., 1998; U.S. Pat. Nos. 5,641,515; 5,580,579 and 5,792,451, each specifically incorporated herein by reference in its entirety). The tablets, troches, pills, capsules and the like may also contain the following: a binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid and the like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, in addition to materials of the above type, a liquid carrier. Various other materials may be present as coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar, or both. A syrup of elixir may contain the active compound sucrose as a sweetening agent methyl and propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. Of course, any material used in preparing any dosage unit form should be pharmaceutically pure and substantially non-toxic in the amounts employed. In addition, the active compounds may be incorporated into sustained-release preparation and formulations.

[0234] Typically, these formulations may contain at least about 0.1% of the active compound or more, although the percentage of the active ingredient(s) may, of course, be varied and may conveniently be between about 1 or 2% and about 60% or 70% or more of the weight or volume of the total formulation. Naturally, the amount of active compound(s) in each therapeutically useful composition may be prepared is such a way that a suitable dosage will be obtained in any given unit dose of the compound. Factors such as solubility, bioavailability, biological half-life, route of administration, product shelf life, as well as other pharmacological considerations will be contemplated by one skilled in the art of preparing such pharmaceutical formulations, and as such, a variety of dosages and treatment regimens may be desirable.

[0235] For oral administration the compositions of the present invention may alternatively be incorporated with one or more excipients in the form of a mouthwash, dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. For example, a mouthwash may be prepared incorporating the active ingredient in the required amount in an appropriate solvent, such as a sodium borate solution (Dobell's Solution). Alternatively, the active ingredient may be incorporated into an oral solution such as one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a dentifrice, or added in a therapeutically-effective amount to a composition that may include water, binders, abrasives, flavoring agents, foaming agents, and humectants. Alternatively the compositions may be fashioned into a tablet or solution form that may be placed under the tongue or otherwise dissolved in the mouth.

[0236] 2. Injectable Delivery

[0237] In certain circumstances it will be desirable to deliver the pharmaceutical compositions disclosed herein parenterally, intravenously, intramuscularly, or even intraperitoneally as described in U.S. Pat. Nos. 5,543,158; 5,641,515 and 5,399,363 (each specifically incorporated herein by reference in its entirety). Solutions of the active compounds as free base or pharmacologically acceptable salts may be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of storage and use, these preparations contain a preservative to prevent the growth of microorganisms.

[0238] The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions (U.S. Pat. No. 5,466,468, specifically incorporated herein by reference in its entirety). In all cases the form must be sterile and must be fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. The prevention of the action of microorganisms can be facilitated by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the injectable compositions can be brought about by the use in the compositions of agents delaying absorption, for example, aluminum monostearate and gelatin.

[0239] For parenteral administration in an aqueous solution, for example, the solution should be suitably buffered if necessary and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this connection, a sterile aqueous medium that can be employed will be known to those of skill in the art in light of the present disclosure. For example, one dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, “Remington's Pharmaceutical Sciences” 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will necessarily occur depending on the condition of the subject being treated. The person responsible for administration will, in any event, determine the appropriate dose for the individual subject. Moreover, for human administration, preparations should meet sterility, pyrogenicity, and the general safety and purity standards as required by FDA Office of Biologics standards.

[0240] Sterile injectable solutions are prepared by incorporating the active compounds in the required amount in the appropriate solvent with various of the other ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle which contains the basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

[0241] The compositions disclosed herein may be formulated in a neutral or salt form. Pharmaceutically-acceptable salts, include the acid addition salts (formed with the free amino groups of the protein) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be administered in a manner compatible with the dosage formulation and in such amount as is therapeutically effective. The formulations are easily administered in a variety of dosage forms such as injectable solutions, drug-release capsules, and the like.

[0242] As used herein, “carrier” includes any and all solvents, dispersion media, vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such media and agents for pharmaceutical active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active ingredient, its use in the therapeutic compositions is contemplated. Supplementary active ingredients can also be incorporated into the compositions.

[0243] The phrase “pharmaceutically-acceptable” refers to molecular entities and compositions that do not produce an allergic or similar untoward reaction when administered to a human. The preparation of an aqueous composition that contains a protein as an active ingredient is well understood in the art. Typically, such compositions are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection can also be prepared. The preparation can also be emulsified.

[0244] 3. Nasal Delivery

[0245] In certain embodiments, the pharmaceutical compositions may be delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. Methods for delivering genes, nucleic acids, and peptide compositions directly to the lungs via nasal aerosol sprays has been described e.g., in U.S. Pat. Nos. 5,756,353 and 5,804,212 (each specifically incorporated herein by reference in its entirety). Likewise, the delivery of drugs using intranasal microparticle resins (Takenaga et al., 1998) and lysophosphatidyl-glycerol compounds (U.S. Pat. No. 5,725,871, specifically incorporated herein by reference in its entirety) are also well-known in the pharmaceutical arts. Likewise, transmucosal drug delivery in the form of a polytetrafluoroetheylene support matrix is described in U.S. Pat. No. 5,780,045 (specifically incorporated herein by reference in its entirety).

[0246] 4. Liposome-, Nanocapsule-, And Microparticle-Mediated Delivery

[0247] In certain embodiments, the inventors contemplate the use of liposomes, nanocapsules, microparticles, microspheres, lipid particles, vesicles, and the like, for the introduction of the compositions of the present invention into suitable host cells. In particular, the compositions of the present invention may be formulated for delivery either encapsulated in a lipid particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like.

[0248] Such formulations may be preferred for the introduction of pharmaceutically-acceptable formulations of the nucleic acids or constructs disclosed herein. The formation and use of liposomes is generally known to those of skill in the art (see for example, Couvreur et al, 1977; Couvreur, 1988; Lasic, 1998; which describes the use of liposomes and nanocapsules in the targeted antibiotic therapy for intracellular bacterial infections and diseases). Recently, liposomes were developed with improved serum stability and circulation half-times (Gabizon and Papahadjopoulos, 1988; Allen and Choun, 1987; U.S. Pat. No. 5,741,516, specifically incorporated herein by reference in its entirety). Further, various methods of liposome and liposome like preparations as potential drug carriers have been reviewed (Takakura, 1998; Chandran et al., 1997; Margalit, 1995; U.S. Pat. Nos. 5,567,434; 5,552,157; 5,565,213; 5,738,868 and 5,795,587, each specifically incorporated herein by reference in its entirety).

[0249] Liposomes have been used successfully with a number of cell types that are normally resistant to transfection by other procedures including T cell suspensions, primary hepatocyte cultures and PC 12 cells (Renneisen et al., 1990; Muller et al., 1990). In addition, liposomes are free of the DNA length constraints that are typical of viral-based delivery systems. Liposomes have been used effectively to introduce genes, drugs (Heath and Martin, 1986; Heath et al., 1986; Balazsovits et al., 1989; Fresta and Puglisi, 1996), radiotherapeutic agents (Pikul et al., 1987), enzymes (Imaizumi et al., 1990a; Imaizumi et aL, 1990b), viruses (Faller and Baltimore, 1984), transcription factors and allosteric effectors (Nicolau and Gersonde, 1979) into a variety of cultured cell lines and animals. In addition, several successful clinical trails examining the effectiveness of liposome-mediated drug delivery have been completed (Lopez-Berestein et al., 1985a; 1985b; Coune, 1988; Sculier et al, 1988). Furthermore, several studies suggest that the use of liposomes is not associated with autoimmune responses, toxicity or gonadal localization after systemic delivery (Mori and Fukatsu, 1992).

[0250] Liposomes are formed from phospholipids that are dispersed in an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles (also termed multilamellar vesicles (MLVs). MLVs generally have diameters of from 25 mn to 4 μm. Sonication of MLVs results in the formation of small unilamellar vesicles (SUVs) with diameters in the range of 200 to 500 Å, containing an aqueous solution in the core.

[0251] Liposomes bear resemblance to cellular membranes and are contemplated for use in connection with the present invention as carriers for the peptide compositions. They are widely suitable as both water- and lipid-soluble substances can be entrapped, ie. in the aqueous spaces and within the bilayer itself, respectively. It is possible that the drug-bearing liposomes may even be employed for site-specific delivery of active agents by selectively modifying the liposomal formulation.

[0252] In addition to the teachings of Couvreur et al. (1977; 1988), the following information may be utilized in generating liposomal formulations. Phospholipids can form a variety of structures other than liposomes when dispersed in water, depending on the molar ratio of lipid to water. At low ratios the liposome is the preferred structure. The physical characteristics of liposomes depend on pH, ionic strength and the presence of divalent cations. Liposomes can show low permeability to ionic and polar substances, but at elevated temperatures undergo a phase transition which markedly alters their permeability. The phase transition involves a change from a closely packed, ordered structure, known as the gel state, to a loosely packed, less-ordered structure, known as the fluid state. This occurs at a characteristic phase-transition temperature and results in an increase in permeability to ions, sugars and drugs.

[0253] In addition to temperature, exposure to proteins can alter the permeability of liposomes. Certain soluble proteins, such as cytochrome c, bind, deform and penetrate the bilayer, thereby causing changes in permeability. Cholesterol inhibits this penetration of proteins, apparently by packing the phospholipids more tightly. It is contemplated that the most useful liposome formations for antibiotic and inhibitor delivery will contain cholesterol.

[0254] The ability to trap solutes varies between different types of liposomes. For example, MLVs are moderately efficient at trapping solutes, but SUVs are extremely inefficient. SUVs offer the advantage of homogeneity and reproducibility in size distribution, however, and a compromise between size and trapping efficiency is offered by large unilamellar vesicles (LUVs). These are prepared by ether evaporation and are three to four times more efficient at solute entrapment than MLVs.

[0255] In addition to liposome characteristics, an important determinant in entrapping compounds is the physicochemical properties of the compound itself. Polar compounds are trapped in the aqueous spaces and nonpolar compounds bind to the lipid bilayer of the vesicle. Polar compounds are released through permeation or when the bilayer is broken, but nonpolar compounds remain affiliated with the bilayer unless it is disrupted by temperature or exposure to lipoproteins. Both types show maximum efflux rates at the phase transition temperature.

[0256] Liposomes interact with cells via four different mechanisms: endocytosis by phagocytic cells of the reticuloendothelial system such as macrophages and neutrophils; adsorption to the cell surface, either by nonspecific weak hydrophobic or electrostatic forces, or by specific interactions with cell-surface components; fusion with the plasma cell membrane by insertion of the lipid bilayer of the liposome into the plasma membrane, with simultaneous release of liposomal contents into the cytoplasm; and by transfer of liposomal lipids to cellular or subcellular membranes, or vice versa, without any association of the liposome contents. It often is difficult to determine which mechanism is operative and more than one may operate at the same time.

[0257] The fate and disposition of intravenously injected liposomes depend on their physical properties, such as size, fluidity, and surface charge. They may persist in tissues for h or days, depending on their composition, and half lives in the blood range from min to several h. Larger liposomes, such as MLVs and LUVs, are taken up rapidly by phagocytic cells of the reticuloendothelial system, but physiology of the circulatory system restrains the exit of such large species at most sites. They can exit only in places where large openings or pores exist in the capillary endothelium, such as the sinusoids of the liver or spleen. Thus, these organs are the predominate site of uptake. On the other hand, SUVs show a broader tissue distribution but still are sequestered highly in the liver and spleen. In general, this in vivo behavior limits the potential targeting of liposomes to only those organs and tissues accessible to their large size. These include the blood, liver, spleen, bone marrow, and lymphoid organs.

[0258] Targeting is generally not a limitation in terms of the present invention. However, should specific targeting be desired, methods are available for this to be accomplished. Antibodies may be used to bind to the liposome surface and to direct the antibody and its drug contents to specific antigenic receptors located on a particular cell-type surface. Carbohydrate determinants (glycoprotein or glycolipid cell-surface components that play a role in cell-cell recognition, interaction and adhesion) may also be used as recognition sites as they have potential in directing liposomes to particular cell types. Mostly, it is contemplated that intravenous injection of liposomal preparations would be used, but other routes of administration are also conceivable.

[0259] Alternatively, the invention provides for pharmaceutically-acceptable nanocapsule formulations of the compositions of the present invention. Nanocapsules can generally entrap compounds in a stable and reproducible way (Henry-Michelland et al., 1987; Quintanar-Guerrero et al., 1998; Douglas et al., 1987). To avoid side effects due to intracellular polymeric overloading, such ultrafine particles (sized around 0.1 μm) should be designed using polymers able to be degraded in vivo. Biodegradable polyalkyl-cyanoacrylate nanoparticles that meet these requirements are contemplated for use in the present invention. Such particles may be are easily made, as described (Couvreur et al., 1980; 1988; zur Muhlen et al., 1998; Zambaux et al. 1998; Pinto-Alphandry et al., 1995 and U.S. Pat. No. 5,145,684, specifically incorporated herein by reference in its entirety).

[0260] Vaccines

[0261] In certain preferred embodiments of the present invention, vaccines are provided. The vaccines will generally comprise one or more pharmaceutical compositions, such as those discussed above, in combination with an immunostimulant. An immunostimulant may be any substance that enhances or potentiates an immune response (antibody and/or cell-mediated) to an exogenous antigen. Examples of immunostimulants include adjuvants, biodegradable microspheres (e.g., polylactic galactide) and liposomes (into which the compound is incorporated; see e.g., Fullerton, U.S. Pat. No. 4,235,877). Vaccine preparation is generally described in, for example, M. F. Powell and M.J. Newman, eds., “Vaccine Design (the subunit and adjuvant approach),” Plenum Press (N.Y., 1995). Pharmaceutical compositions and vaccines within the scope of the present invention may also contain other compounds, which may be biologically active or inactive. For example, one or more immunogenic portions of other tumor antigens may be present, either incorporated into a fusion polypeptide or as a separate compound, within the composition or vaccine.

[0262] Illustrative vaccines may contain DNA encoding one or more of the polypeptides as described above, such that the polypeptide is generated in situ. As noted above, the DNA may be present within any of a variety of delivery systems known to those of ordinary skill in the art, including nucleic acid expression systems, bacteria and viral expression systems. Numerous gene delivery techniques are well known in the art, such as those described by Rolland, Crit. Rev. Therap. Drug Carrier Systems 15:143-198, 1998, and references cited therein. Appropriate nucleic acid expression systems contain the necessary DNA sequences for expression in the patient (such as a suitable promoter and terminating signal). Bacterial delivery systems involve the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. In a preferred embodiment, the DNA may be introduced using a viral expression system (e.g., vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic (defective), replication competent virus. Suitable systems are disclosed, for example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci. USA 86:317-321, 1989; Flexner et al., Ann. N.Y Acad. Sci. 569:86-103, 1989; Flexner et al., Vaccine 8:17-21, 1990; U.S. Pat. Nos. 4,603,112, 4,769,330, and 5,017,487; WO 89/01973; U.S. Pat. No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; Rosenfeld et al., Science 252:431-434, 1991; Kolls et al., Proc. Natl. Acad. Sci. USA 91:215-219, 1994; Kass-Eisler et al., Proc. Natl. Acad. Sci. USA 90:11498-11502, 1993; Guzman et al., Circulation 88:2838-2848, 1993; and Guzman et al., Cir. Res. 73:1202-1207, 1993. Techniques for incorporating DNA into such expression systems are well known to those of ordinary skill in the art. The DNA may also be “naked,” as described, for example, in Ulmer et al., Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may be increased by coating the DNA onto biodegradable beads, which are efficiently transported into the cells. It will be apparent that a vaccine may comprise both a polynucleotide and a polypeptide component. Such vaccines may provide for an enhanced immune response.

[0263] It will be apparent that a vaccine may contain pharmaceutically acceptable salts of the polynucleotides and polypeptides provided herein. Such salts may be prepared from pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., sodium, potassium, lithium, ammonium, calcium and magnesium salts).

[0264] While any suitable carrier known to those of ordinary skill in the art may be employed in the vaccine compositions of this invention, the type of carrier will vary depending on the mode of administration. Compositions of the present invention may be formulated for any appropriate manner of administration, including for example, topical, oral, nasal, intravenous, intracranial, intraperitoneal, subcutaneous or intramuscular administration. For parenteral administration, such as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, sucrose, and magnesium carbonate, may be employed. Biodegradable microspheres (e.g., polylactate polyglycolate) may also be employed as carriers for the pharmaceutical compositions of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. Pat. Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 5,814,344 and 5,942,252. One may also employ a carrier comprising the particulate-protein complexes described in U.S. Pat. No. 5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte responses in a host.

[0265] Such compositions may also comprise buffers (e.g., neutral buffered saline or phosphate buffered saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a recipient, suspending agents, thickening agents and/or preservatives. Alternatively, compositions of the present invention may be formulated as a lyophilizate. Compounds may also be encapsulated within liposomes using well known technology.

[0266] Any of a variety of immunostimulants may be employed in the vaccines of this invention. For example, an adjuvant may be included. Most adjuvants contain a substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Mich.); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, N.J.); AS-2 (SmithKline Beecham, Philadelphia, Pa.); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2,-7, or -12, may also be used as adjuvants.

[0267] Within the vaccines provided herein, the adjuvant composition is preferably designed to induce an immune response predominantly of the Thl type. High levels of Thl-type cytokines (e.g., IFN-γ, TNFα, IL-2 and IL-12) tend to favor the induction of cell mediated immune responses to an administered antigen. In contrast, high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the induction of humoral immune responses. Following application of a vaccine as provided herein, a patient will support an immune response that includes Th1- and Th2-type responses. Within a preferred embodiment, in which a response is predominantly Th1-type, the level of Th1-type cytokines will increase to a greater extent than the level of Th2-type cytokines. The levels of these cytokines may be readily assessed using standard assays. For a review of the families of cytokines, see Mosmann and Coffman, Ann. Rev. Immunol. 7:145-173, 1989.

[0268] Preferred adjuvants for use in eliciting a predominantly Thl-type response include, for example, a combination of monophosphoryl lipid A, preferably 3-de-O-acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. MPL adjuvants are available from Corixa Corporation (Seattle, Wash.; see US Pat. Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a predominantly Thl response. Such oligonucleotides are well known and are described, for example, in WO 96/02555, WO 99/33488 and U.S. Pat. Nos. 6,008,200 and 5,856,462. Immunostimulatory DNA sequences are also described, for example, by Sato et al., Science 273:352, 1996. Another preferred adjuvant is a saponin, preferably QS21 (Aquila Biopharmaceuticals Inc., Framingham, Mass.), which may be used alone or in combination with other adjuvants. For example, an enhanced system involves the combination of a monophosphoryl lipid A and saponin derivative, such as the combination of QS21 and 3D-MPL as described in WO 94/00153, or a less reactogenic composition where the QS21 is quenched with cholesterol, as described in WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and tocopherol. A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in an oil-in-water emulsion is described in WO 95/17210.

[0269] Other preferred adjuvants include Montanide ISA 720 (Seppic, France), SAF (Chiron, Calif. United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, Belgium), Detox (Corixa, Hamilton, Mont.), RC-529 (Corixa, Hamilton, Mont.) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described in pending U.S. patent application Ser. Nos. 08/853,826 and 09/074,720, the disclosures of which are incorporated herein by reference in their entireties.

[0270] Any vaccine provided herein may be prepared using well known methods that result in a combination of antigen, immune response enhancer and a suitable carrier or excipient. The compositions described herein may be administered as part of a sustained release formulation (i.e., a formulation such as a capsule, sponge or gel (composed of polysaccharides, for example) that effects a slow release of compound following administration). Such formulations may generally be prepared using well known technology (see, e.g., Coombes et al., Vaccine 14:1429-1438, 1996) and administered by, for example, oral, rectal or subcutaneous implantation, or by implantation at the desired target site. Sustained-release formulations may contain a polypeptide, polynucleotide or antibody dispersed in a carrier matrix and/or contained within a reservoir surrounded by a rate controlling membrane.

[0271] Carriers for use within such formulations are biocompatible, and may also be biodegradable; preferably the formulation provides a relatively constant level of active component release. Such carriers include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, starch, cellulose, dextran and the like. Other delayed-release carriers include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Pat. No. 5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The amount of active compound contained within a sustained release formulation depends upon the site of implantation, the rate and expected duration of release and the nature of the condition to be treated or prevented.

[0272] Any of a variety of delivery vehicles may be employed within pharmaceutical compositions and vaccines to facilitate production of an antigen-specific immune response that targets tumor cells. Delivery vehicles include antigen presenting cells (APCs), such as dendritic cells, macrophages, B cells, monocytes and other cells that may be engineered to be efficient APCs. Such cells may, but need not, be genetically modified to increase the capacity for presenting the antigen, to improve activation and/or maintenance of the T cell response, to have anti-tumor effects per se and/or to be immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs may generally be isolated from any of a variety of biological fluids and organs, including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic or xenogeneic cells.

[0273] Certain preferred embodiments of the present invention use dendritic cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent APCs (Banchereau and Steinman, Nature 392:245-251, 1998) and have been shown to be effective as a physiological adjuvant for eliciting prophylactic or therapeutic antitumor immunity (see Timmerman and Levy, Ann. Rev. Med. 50:507-529, 1999). In general, dendritic cells may be identified based on their typical shape (stellate in situ, with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, process and present antigens with high efficiency and their ability to activate nai've T cell responses. Dendritic cells may, of course, be engineered to express specific cell-surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex vivo, and such modified dendritic cells are contemplated by the present invention. As an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may be used within a vaccine (see Zitvogel et al., Nature Med. 4:594-600, 1998).

[0274] Dendritic cells and progenitors may be obtained from peripheral blood, bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL-4, IL-13 and/or TNFα to cultures of monocytes harvested from peripheral blood. Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood or bone marrow may be differentiated into dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFα, CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, maturation and proliferation of dendritic cells.

[0275] Dendritic cells are conveniently categorized as “immature” and “mature” cells, which allows a simple way to discriminate between two well characterized phenotypes. However, this nomenclature should not be construed to exclude all possible intermediate stages of differentiation. Immature dendritic cells are characterized as APC with a high capacity for antigen uptake and processing, which correlates with the high expression of Fcγ receptor and mannose receptor. The mature phenotype is typically characterized by a lower expression of these markers, but a high expression of cell surface molecules responsible for T cell activation such as class I and class II MHC, adhesion molecules (e.g., CD54 and CDl 1) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4-1BB).

[0276] APCs may generally be transfected with a polynucleotide encoding an ovarian tumor protein (or portion or other variant thereof) such that the ovarian tumor polypeptide, or an immunogenic portion thereof, is expressed on the cell surface. Such transfection may take place ex vivo, and a composition or vaccine comprising such transfected cells may then be used for therapeutic purposes, as described herein. Alternatively, a gene delivery vehicle that targets a dendritic or other antigen presenting cell may be administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of dendritic cells, for example, may generally be performed using any methods known in the art, such as those described in WO 97/24447, or the gene gun approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may be achieved by incubating dendritic cells or progenitor cells with the ovarian tumor polypeptide, DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be covalently conjugated to an immunological partner that provides T cell help (e.g., a carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated immunological partner, separately or in the presence of the polypeptide.

[0277] Vaccines and pharmaceutical compositions may be presented in unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers are preferably hermetically sealed to preserve sterility of the formulation until use. In general, formulations may be stored as suspensions, solutions or emulsions in oily or aqueous vehicles. Alternatively, a vaccine or pharmaceutical composition may be stored in a freeze-dried condition requiring only the addition of a sterile liquid carrier immediately prior to use.

[0278] Cancer Therapy

[0279] In further aspects of the present invention, the compositions described herein may be used for immunotherapy of cancer, such as ovarian cancer. Within such methods, pharmaceutical compositions and vaccines are typically administered to a patient. As used herein, a “patient” refers to any warm-blooded animal, preferably a human. A patient may or may not be afflicted with cancer. Accordingly, the above pharmaceutical compositions and vaccines may be used to prevent the development of a cancer or to treat a patient afflicted with a cancer. A cancer may be diagnosed using criteria generally accepted in the art, including the presence of a malignant tumor. Pharmaceutical compositions and vaccines may be administered either prior to or following surgical removal of primary tumors and/or treatment such as administration of radiotherapy or conventional chemotherapeutic drugs. Administration may be by any suitable method, including administration by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, anal, vaginal, topical and oral routes.

[0280] Within certain embodiments, immunotherapy may be active immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous host immune system to react against tumors with the administration of immune response-modifying agents (such as polypeptides and polynucleotides as provided herein).

[0281] Within other embodiments, immunotherapy may be passive immunotherapy, in which treatment involves the delivery of agents with established tumor-immune reactivity (such as effector cells or antibodies) that can directly or indirectly mediate antitumor effects and does not necessarily depend on an intact host immune system. Examples of effector cells include T cells as discussed above, T lymphocytes (such as CD8⁺ cytotoxic T lymphocytes and CD4⁺ T-helper tumor-infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine-activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and macrophages) expressing a polypeptide provided herein. T cell receptors and antibody receptors specific for the polypeptides recited herein may be cloned, expressed and transferred into other vectors or effector cells for adoptive immunotherapy. The polypeptides provided herein may also be used to generate antibodies or anti-idiotypic antibodies (as described above and in U.S. Pat. No. 4,918,164) for passive immunotherapy.

[0282] Effector cells may generally be obtained in sufficient quantities for adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for expanding single antigen-specific effector cells to several billion in number with retention of antigen recognition in vivo are well known in the art. Such in vitro culture conditions typically use intermittent stimulation with antigen, often in the presence of cytokines (such as IL-2) and non-dividing feeder cells. As noted above, immunoreactive polypeptides as provided herein may be used to rapidly expand antigen-specific T cell cultures in order to generate a sufficient number of cells for immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides or transfected with one or more polynucleotides using standard techniques well known in the art. For example, antigen-presenting cells can be transfected with a polynucleotide having a promoter appropriate for increasing expression in a recombinant virus or other expression system. Cultured effector cells for use in therapy must be able to grow and distribute widely, and to survive long term in vivo. Studies have shown that cultured effector cells can be induced to grow in vivo and to survive long term in substantial numbers by repeated stimulation with antigen supplemented with IL-2 (see, for example, Cheever et al., Immunological Reviews 157:177, 1997).

[0283] Alternatively, a vector expressing a polypeptide recited herein may be introduced into antigen presenting cells taken from a patient and clonally propagated ex vivo for transplant back into the same patient. Transfected cells may be reintroduced into the patient using any means known in the art, preferably in sterile form by intravenous, intracavitary, intraperitoneal or intratumor administration.

[0284] Routes and frequency of administration of the therapeutic compositions described herein, as well as dosage, will vary from individual to individual, and may be readily established using standard techniques. In general, the pharmaceutical compositions and vaccines may be administered by injection (e.g., intracutaneous, intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Preferably, between 1 and 10 doses may be administered over a 52 week period. Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations may be given periodically thereafter. Alternate protocols may be appropriate for individual patients. A suitable dose is an amount of a compound that, when administered as described above, is capable of promoting an anti-tumor immune response, and is at least 10-50% above the basal (i.e., untreated) level. Such response can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent generation of cytolytic effector cells capable of killing the patient's tumor cells in vitro. Such vaccines should also be capable of causing an immune response that leads to an improved clinical outcome (e.g., more frequent remissions, complete or partial or longer disease-free survival) in vaccinated patients as compared to non-vaccinated patients. In general, for pharmaceutical compositions and vaccines comprising one or more polypeptides, the amount of each polypeptide present in a dose ranges from about 25 μg to 5 mg per kg of host. Suitable dose sizes will vary with the size of the patient, but will typically range from about 0.1 mL to about 5 mL.

[0285] In general, an appropriate dosage and treatment regimen provides the active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic benefit. Such a response can be monitored by establishing an improved clinical outcome (e.g., more frequent remissions, complete or partial, or longer disease-free survival) in treated patients as compared to non-treated patients. Increases in preexisting immune responses to an ovarian tumor protein generally correlate with an improved clinical outcome. Such immune responses may generally be evaluated using standard proliferation, cytotoxicity or cytokine assays, which may be performed using samples obtained from a patient before and after treatment.

[0286] Cancer Detection And Diagnosis

[0287] In general, a cancer may be detected in a patient based on the presence of one or more ovarian tumor proteins and/or polynucleotides encoding such proteins in a biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) obtained from the patient. In other words, such proteins may be used as markers to indicate the presence or absence of a cancer such as ovarian cancer. In addition, such proteins may be useful for the detection of other cancers. The binding agents provided herein generally permit detection of the level of antigen that binds to the agent in the biological sample. Polynucleotide primers and probes may be used to detect the level of mRNA encoding a tumor protein, which is also indicative of the presence or absence of a cancer. In general, an ovarian tumor sequence should be present at a level that is at least three fold higher in tumor tissue than in normal tissue

[0288] There are a variety of assay formats known to those of ordinary skill in the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In general, the presence or absence of a cancer in a patient may be determined by (a) contacting a biological sample obtained from a patient with a binding agent; (b) detecting in the sample a level of polypeptide that binds to the binding agent; and (c) comparing the level of polypeptide with a predetermined cut-off value.

[0289] In a preferred embodiment, the assay involves the use of binding agent immobilized on a solid support to bind to and remove the polypeptide from the remainder of the sample. The bound polypeptide may then be detected using a detection reagent that contains a reporter group and specifically binds to the binding agent/polypeptide complex. Such detection reagents may comprise, for example, a binding agent that specifically binds to the polypeptide or an antibody or other agent that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a polypeptide is labeled with a reporter group and allowed to bind to the immobilized binding agent after incubation of the binding agent with the sample. The extent to which components of the sample inhibit the binding of the labeled polypeptide to the binding agent is indicative of the reactivity of the sample with the immobilized binding agent. Suitable polypeptides for use within such assays include full length ovarian tumor proteins and portions thereof to which the binding agent binds, as described above.

[0290] The solid support may be any material known to those of ordinary skill in the art to which the tumor protein may be attached. For example, the solid support may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. Pat. No. 5,359,681. The binding agent may be immobilized on the solid support using a variety of techniques known to those of skill in the art, which are amply described in the patent and scientific literature. In the context of the present invention, the term “immobilization” refers to both noncovalent association, such as adsorption, and covalent attachment (which may be a direct linkage between the agent and functional groups on the support or may be a linkage by way of a cross-linking agent). Immobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In such cases, adsorption may be achieved by contacting the binding agent, in a suitable buffer, with the solid support for a suitable amount of time. The contact time varies with temperature, but is typically between about 1 hour and about 1 day. In general, contacting a well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 10 μg, and preferably about 100 ng to about 1 μg, is sufficient to immobilize an adequate amount of binding agent.

[0291] Covalent attachment of binding agent to a solid support may generally be achieved by first reacting the support with a bifunctional reagent that will react with both the support and a functional group, such as a hydroxyl or amino group, on the binding agent. For example, the binding agent may be covalently attached to supports having an appropriate polymer coating using benzoquinone or by condensation of an aldehyde group on the support with an amine and an active hydrogen on the binding partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at A12-A13).

[0292] In certain embodiments, the assay is a two-antibody sandwich assay. This assay may be performed by first contacting an antibody that has been immobilized on a solid support, commonly the well of a microtiter plate, with the sample, such that polypeptides within the sample are allowed to bind to the immobilized antibody. Unbound sample is then removed from the immobilized polypeptide-antibody complexes and a detection reagent (preferably a second antibody capable of binding to a different site on the polypeptide) containing a reporter group is added. The amount of detection reagent that remains bound to the solid support is then determined using a method appropriate for the specific reporter group.

[0293] More specifically, once the antibody is immobilized on the support as described above, the remaining protein binding sites on the support are typically blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum albumin or Tween 20TM (Sigma Chemical Co., St. Louis, Mo.). The immobilized antibody is then incubated with the sample, and polypeptide is allowed to bind to the antibody. The sample may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact time (i.e., incubation time) is a period of time that is sufficient to detect the presence of polypeptide within a sample obtained from an individual with ovarian cancer. Preferably, the contact time is sufficient to achieve a level of binding that is at least about 95% of that achieved at equilibrium between bound and unbound polypeptide. Those of ordinary skill in the art will recognize that the time necessary to achieve equilibrium may be readily determined by assaying the level of binding that occurs over a period of time. At room temperature, an incubation time of about 30 minutes is generally sufficient.

[0294] Unbound sample may then be removed by washing the solid support with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second antibody, which contains a reporter group, may then be added to the solid support. Preferred reporter groups include those groups recited above.

[0295] The detection reagent is then incubated with the immobilized antibody-polypeptide complex for an amount of time sufficient to detect the bound polypeptide. An appropriate amount of time may generally be determined by assaying the level of binding that occurs over a period of time. Unbound detection reagent is then removed and bound detection reagent is detected using the reporter group. The method employed for detecting the reporter group depends upon the nature of the reporter group. For radioactive groups, scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may be detected using avidin, coupled to a different reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme reporter groups may generally be detected by the addition of substrate (generally for a specific period of time), followed by spectroscopic or other analysis of the reaction products.

[0296] To determine the presence or absence of a cancer, such as ovarian cancer, the signal detected from the reporter group that remains bound to the solid support is generally compared to a signal that corresponds to a predetermined cut-off value. In one preferred embodiment, the cut-off value for the detection of a cancer is the average mean signal obtained when the immobilized antibody is incubated with samples from patients without the cancer. In general, a sample generating a signal that is three standard deviations above the predetermined cut-off value is considered positive for the cancer. In an alternate preferred embodiment, the cut-off value is determined using a Receiver Operator Curve, according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible cut-off value for the diagnostic test result. The cut-off value on the plot that is the closest to the upper left-hand comer (i.e., the value that encloses the largest area) is the most accurate cut-off value, and a sample generating a signal that is higher than the cut-off value determined by this method may be considered positive. Alternatively, the cut-off value may be shifted to the left along the plot, to minimize the false positive rate, or to the right, to minimize the false negative rate. In general, a sample generating a signal that is higher than the cut-off value determined by this method is considered positive for a cancer.

[0297] In a related embodiment, the assay is performed in a flow-through or strip test format, wherein the binding agent is immobilized on a membrane, such as nitrocellulose. In the flow-through test, polypeptides within the sample bind to the immobilized binding agent as the sample passes through the membrane. A second, labeled binding agent then binds to the binding agent-polypeptide complex as a solution containing the second binding agent flows through the membrane. The detection of bound second binding agent may then be performed as described above. In the strip test format, one end of the membrane to which binding agent is bound is immersed in a solution containing the sample. The sample migrates along the membrane through a region containing second binding agent and to the area of immobilized binding agent. Concentration of second binding agent at the area of immobilized antibody indicates the presence of a cancer. Typically, the concentration of second binding agent at that site generates a pattern, such as a line, that can be read visually. The absence of such a pattern indicates a negative result. In general, the amount of binding agent immobilized on the membrane is selected to generate a visually discernible pattern when the biological sample contains a level of polypeptide that would be sufficient to generate a positive signal in the two-antibody sandwich assay, in the format discussed above. Preferred binding agents for use in such assays are antibodies and antigen-binding fragments thereof. Preferably, the amount of antibody immobilized on the membrane ranges from about 25 ng to about 1 μg, and more preferably from about 50 ng to about 500 ng. Such tests can typically be performed with a very small amount of biological sample.

[0298] Of course, numerous other assay protocols exist that are suitable for use with the tumor proteins or binding agents of the present invention. The above descriptions are intended to be exemplary only. For example, it will be apparent to those of ordinary skill in the art that the above protocols may be readily modified to use ovarian tumor polypeptides to detect antibodies that bind to such polypeptides in a biological sample. The detection of such ovarian tumor protein specific antibodies may correlate with the presence of a cancer.

[0299] A cancer may also, or alternatively, be detected based on the presence of T cells that specifically react with an ovarian tumor protein in a biological sample. Within certain methods, a biological sample comprising CD4⁺ and/or CD8⁺ T cells isolated from a patient is incubated with an ovarian tumor polypeptide, a polynucleotide encoding such a polypeptide and/or an APC that expresses at least an immunogenic portion of such a polypeptide, and the presence or absence of specific activation of the T cells is detected. Suitable biological samples include, but are not limited to, isolated T cells. For example, T cells may be isolated from a patient by routine techniques (such as by Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T cells may be incubated in vitro for 2-9 days (typically 4 days) at 37° C. with polypeptide (e.g., 5-25 μg/ml). It may be desirable to incubate another aliquot of a T cell sample in the absence of an ovarian tumor polypeptide to serve as a control. For CD4⁺ T cells, activation is preferably detected by evaluating proliferation of the T cells. For CD8⁺ T cells, activation is preferably detected by evaluating cytolytic activity. A level of proliferation that is at least two fold greater and/or a level of cytolytic activity that is at least 20% greater than in disease-free patients indicates the presence of a cancer in the patient.

[0300] As noted above, a cancer may also, or alternatively, be detected based on the level of mRNA encoding an ovarian tumor protein in a biological sample. For example, at least two oligonucleotide primers may be employed in a polymerase chain reaction (PCR) based assay to amplify a portion of an ovarian tumor cDNA derived from a biological sample, wherein at least one of the oligonucleotide primers is specific for (i.e., hybridizes to) a polynucleotide encoding the ovarian tumor protein. The amplified cDNA is then separated and detected using techniques well known in the art, such as gel electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a polynucleotide encoding an ovarian tumor protein may be used in a hybridization assay to detect the presence of polynucleotide encoding the tumor protein in a biological sample.

[0301] To permit hybridization under assay conditions, oligonucleotide primers and probes should comprise an oligonucleotide sequence that has at least about 60%, preferably at least about 75% and more preferably at least about 90%, identity to a portion of a polynucleotide encoding an ovarian tumor protein that is at least 10 nucleotides, and preferably at least 20 nucleotides, in length. Preferably, oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a polypeptide described herein under moderately stringent conditions, as defined above. Oligonucleotide primers and/or probes which may be usefully employed in the diagnostic methods described herein preferably are at least 10-40 nucleotides in length. In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule having a sequence recited in SEQ ID NO:1-38. Techniques for both PCR based assays and hybridization assays are well known in the art (see, for example, Mullis et al., Cold Spring Harbor Symp. Quant. Biol., 51:263, 1987; Erlich ed., PCR Technology, Stockton Press, N.Y., 1989).

[0302] One preferred assay employs RT-PCR, in which PCR is applied in conjunction with reverse transcription. Typically, RNA is extracted from a biological sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. PCR amplification using at least one specific primer generates a cDNA molecule, which may be separated and visualized using, for example, gel electrophoresis. Amplification may be performed on biological samples taken from a test patient and from an individual who is not afflicted with a cancer. The amplification reaction may be performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold or greater increase in expression in several dilutions of the test patient sample as compared to the same dilutions of the non-cancerous sample is typically considered positive.

[0303] In another embodiment, the compositions described herein may be used as markers for the progression of cancer. In this embodiment, assays as described above for the diagnosis of a cancer may be performed over time, and the change in the level of reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be performed every 24-72 hours for a period of 6 months to 1 year, and thereafter performed as needed. In general, a cancer is progressing in those patients in whom the level of polypeptide or polynucleotide detected increases over time. In contrast, the cancer is not progressing when the level of reactive polypeptide or polynucleotide either remains constant or decreases with time.

[0304] Certain in vivo diagnostic assays may be performed directly on a tumor. One such assay involves contacting tumor cells with a binding agent. The bound binding agent may then be detected directly or indirectly via a reporter group. Such binding agents may also be used in histological applications. Alternatively, polynucleotide probes may be used within such applications.

[0305] As noted above, to improve sensitivity, multiple ovarian tumor protein markers may be assayed within a given sample. It will be apparent that binding agents specific for different proteins provided herein may be combined within a single assay. Further, multiple primers or probes may be used concurrently. The selection of tumor protein markers may be based on routine experiments to determine combinations that results in optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided herein may be combined with assays for other known tumor antigens.

[0306] Diagnostic Kits

[0307] The present invention further provides kits for use within any of the above diagnostic methods. Such kits typically comprise two or more components necessary for performing a diagnostic assay. Components may be compounds, reagents, containers and/or equipment. For example, one container within a kit may contain a monoclonal antibody or fragment thereof that specifically binds to an ovarian tumor protein. Such antibodies or fragments may be provided attached to a support material, as described above. One or more additional containers may enclose elements, such as reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain a detection reagent as described above that contains a reporter group suitable for direct or indirect detection of antibody binding.

[0308] Alternatively, a kit may be designed to detect the level of mRNA encoding an ovarian tumor protein in a biological sample. Such kits generally comprise at least one oligonucleotide probe or primer, as described above, that hybridizes to a polynucleotide encoding an ovarian tumor protein. Such an oligonucleotide may be used, for example, within a PCR or hybridization assay. Additional components that may be present within such kits include a second oligonucleotide and/or a diagnostic reagent or container to facilitate the detection of a polynucleotide encoding an ovarian tumor protein.

[0309] The following Examples are offered by way of illustration and not by way of limitation.

EXAMPLES Example 1 Identification Of cDNAs Encoding Ovarian Tumor Antigens

[0310] SEQ ID NO: 1-38 disclose sequences obtained through the construction of cDNA libraries from ovarian tumor tissue. All sequences are believed to be novel in that they do not share significant identity with sequences present in the GenBank non-redundant or the GeneSeq DNA databases.

Example 2 Expression Cloning Of Ovarian Tumor Antigens Using Antisera To Secreted Ovarian Tumor Cell Line Proteins

[0311] This Example illustrates the identification of cDNA molecules encoding ovarian tumor proteins. A cDNA expression library was created in Lambda Zap Express (Stratagene) using the ovarian cell line tumor ES-2. Rabbit anti-sera was generated by immunizations with secreted/shed proteins from the ES-2 cell line culture. This antisera was used for screening, after reactivity to E. coli proteins and human GAPDH was removed by pre-adsorption of the sera with these proteins. Positive clones from this screening were purified, and characterized by partial DNA sequencing and database searching. The sequences are disclosed in SEQ ID NOs:39-48.

Example 3 Identification Of Additional cDNAs Encoding Ovarian Tumor Antigens

[0312] This Example illustrates the identification of cDNA molecules encoding ovarian tumor proteins. The sequences were obtained through the construction of cDNA libraries from ovarian tissue sources. All sequences are believed to be novel in that they do not share significant identity with sequences present in the GenBank non-redundant or the GeneSeq DNA databases. The sequences are disclosed in SEQ ID NOs:49-87.

Example 4 Identification Of cDNAs Encoding Ovarian Tumor Amtogems Using A Subtrated cDNA Library

[0313] This Example illustrates the identification of cDNA molecules encoding ovarian tumor proteins. A PCR subtracted cDNA expression library was constructed from a human ovary tumor line (OTL 298-95) in order to clone the antigen expressed by this tumor that was recognized by a human CD8+ T-cell clone. Recognition of the ovarian tumor by this T-cell clone resulted in the activation of this T-cell clone as demonstrated by specific IFN-gamma release.

[0314] The generation of a subtracted expression library from a human ovary tumor line provides an enriched source of cDNAs overexpressed in ovary tumor and normal tissue. One of the strengths of the PCR subtraction methodology lies in the normalization of differentially expressed cDNAs, such that the recovery of rare transcripts is facilitated. Thus, such an expression library will contain a unique distribution of cDNAs that would otherwise be unobtainable by conventional subtraction or full-length library approaches.

[0315] The library disclosed herein was constructed to facilitate the cloning and identification of the antigen expressed by a human ovarian tumor line (OTL 298-95) that is recognized by human CD8+ T-cell clone 31. The antigen is specific to ovarian tumors. Moreover, the antigen is capable of eliciting an immune response as demonstrated by the existence of cytotoxic CD8+ T cells that recognize the antigen. Furthermore, this antigen is shared in that certain allogeneic ovarian tumor lines were also recognized by this T cell clone when transduced with the restricting HLA Class I allele.

[0316] Poly A tester mRNA was prepared from a human ovary tumor line (OTL 298-95). Driver RNA was isolated from a human mammary epithelial cell line (CHEM2/CHMEC1). This line was chosen as it is derived from normal epithelium, in contrast to OTL 298-95 which is tumor epithelium, and represents a cell line that is not recognized by the OTL 298-95 reactive T-cells. Taken together, this driver cell line should not subtract away the gene of interest. The subsequent steps involving cDNA synthesis, hybridizations, and PCR amplifications were performed according to Clontech's user manual (PCR-Select cDNA Subtraction; incorporated herein by reference). Modifications to the protocol were made at the cDNA restriction step and at the second amplification step. Two different pools of cDNA were subtracted with different restriction enzymes: pool 1 was restricted with MscI, PvuII, Stul and DraI, and pool 2 was restricted with RsaI. Each restriction set was treated as a separate library to ensure that the final mixed library contained overlapping fragments. Thus, the epitope recognized by the T-cells should be represented on a fragment within the library and not destroyed by the presence of a single restriction site within it. The tester to driver ratio during the hybridization was as described in the protocol. In contrast to a deeper subtraction, this narrow subtraction should result in normalization without the removal of some marginal differentially expressed genes.

[0317] The second amplification step utilized primers that were modified from those normally used. Nested PCR primer 1 was engineered to contain a cleavable Not I site (5′ATAAGAATTCGAGCGGCCGCCCGGGCAGGT-3′ (SEQ ID NO: 170)). Three nested PCR primers 2R were engineered to contain a cleavable EcoRI site that was in one of three frames (+1:5′-CCGGAATTCAGCGTGGTCGCGGCCGAGGT-3′ (SEQ ID NO:171); +2:5′-CCGGAATTCCAGCGTGGTCGCGGCCGAGGT-3′ (SEQ ID NO:172); +3:5′-CCGGAATTCACAGCGTGGTCGCGGCCGAGGT-3′ (SEQ ID NO: 173)). Thus, secondary amplification with primer 1-(EcoR I) and an equal mix of primers 2R-(Not I)+1/+2/+3 resulted in products that, when cleaved with EcoR I and Not I, could be ligated directionally into a retroviral vector (pBIBKMS) cleaved with Not I and EcoR I. This resulted in the PCR subtracted and amplified fragments being represented in-frame (1 in 3 chance) somewhere within the library. Due to the mechanics of the subtraction, only 50% of fragments will be in the correct orientation. In the final library, 17% of members can be expected to encode a functional insert that is translated (14% by actual characterization). However, this represented greater than 10,000 primary clones and, even with redundancy of members, represents a complexity that can readily be screened for expression of the antigenic epitope recognized by a T-cell clone.

[0318] Disclosed in SEQ ID NOs:88-168 are 81 associated DNA sequences from the OTL 298-295 PCR subtracted retroviral expression library.

Example 5 Identification Of cDNAs Encoding Ovarian Tumor Proteins

[0319] This Example illustrates the identification of cDNA molecules encoding ovarian tumor proteins. cDNAs were obtained through the screening of an expression cDNA library derived from a cell line grown out of a human ovarian cancer tumor, as described in detail below. The library was probed with serum from a rabbit immunized with membrane material from the same cell line used for construction of the library. The expressed polypeptides encoded by the cDNAs described herein were selected based on their ability to bind the rabbit immunoglobulin, as described below.

[0320] For the membrane preparation, human ovarian cancer tumor cells were pelleted and homogenized with a Dounce Homogenizer in 250 mM sucrose, 10 mM HEPES, 1 mM EDTA, and one complete protease inhibitor tablet (Roche, Basel, Switzerland), at pH 7.4. The homogenized cells were pelleted at 800×g to remove cell debris and then at 8000×g to remove organelles. The remaining supernatant was ultracentrifuged at 100,000×g to pellet the membranes. Protein concentration was determined by the method of Lowry and injected into rabbits at 0.5 mg/ml for the generation of antiserum.

[0321] Immuno-reactive proteins were screened from approximately 4×10₅ PFU from an unamplified cDNA expression library. Fifteen 150 mm LB agar petri dishes were plated with approximately 3×10⁴ PFU and incubated at 42° C. until plaques formed. Nitrocellulose filters (Schleicher and Schuell, Keene, N. H.), pre-wet with 10 mM IPTG, were placed on the plates and then incubated at 37° C. over night. Filters were then removed and washed 3× with PBS, 0.1% Tween 20, blocked with 1.0% BSA (Sigma, St. Louis, Mo.) in PBS, 0.1% Tween 20, and finally washed 3× with PBS, 0.1% Tween 20.

[0322] Blocked filters were then incubated overnight at 4° C. with rabbit antiserum that was developed against a total membrane preparation of cell line, diluted 1:200 in PBS, 0.1% Tween-20 and preadsorbed with E. coli and other proteins to remove superfluous and irrelevant antibodies. The filters were then washed 3× with PBS-Tween 20 and incubated with a goat-anti-rabbit IgG (H and L) secondary antibody (diluted 1:1 000 with PBS-Tween 20) conjugated with alkaline phosphatase (Rockland Laboratories, Gilbertsville, Pa.) for 1 hr. There filters were then washed 3× with PBS, Tween 20 and 2× with alkaline phosphatase buffer (pH 9.5) and finally developed with NBT/BCIP (Gibco BRL, Rockville, Md.).

[0323] Reactive plaques were excised from the LB agarose plates and a second or third plaque purification was performed following the same protocol. Excision of phagemid followed the Stratagene Lambda ZAP Express protocol, and resulting plasmid DNA was sequenced with an automated sequencer (Applied Biosystems, Foster City, Calif.) using M13 forward, reverse and internal DNA sequencing primers. Nucleic acid homology searches were performed against the GenBank nucleic acid database. Those showing some degree of similarity with sequences in the databases are described in Table 3. cDNAs that showed no significant similarity to known sequences in the database are listed in Table 4. TABLE 3 GENBANK SEARCH RESULTS FOR cDNA MOLECULES ENCODING HUMAN OVARIAN TUMOR PROTEINS SEQ ID NO SEQ ID Clone of GenBank NO: Identifier GenBank Search Results sequence 173 65346 Hu.26S proteasome subunit5a non-ATPase,4; antisecretory 264 factor-1 174 65347 Hu.cleavage stimulation factor 77kD subunit 265 176 65349 94% Dog thyroid mRNA for C3VS 266 177 65351 Hu. transcriptional coactivator ALY 267 178 65352 Human (plasma) protein S 268 179 and 65353 Hu. keratin 18 269 180 181 65354 Hu.complement subcomponent binding pro. (C1QBP); P32 270 subunit of human pre-mRNA splicing factor SF2 182 65356 84% Mu.partial Rabip4 FYVE-finger containing protein 271 183 65357 Hu. glucocorticoid-induced leucine zipper 272 184 67313 Hu. major vault protein;sim. to kinesin-like 4 273 185 67705 Hu.chromosme-assoc.protein-E (CAP-E) 274 186 65360 portion of Hu.clone RP11-430K20 187 65362 Hu.proteasome 26S subunit p27,non-ATPase, 9 (PSMD9) 275 189 and 65364 Hu.cDNA FLJ10188 fis, clone HEMBA1004693 276 190 191 66251 Hu.eukaryotic translation initiation factor 4 gamma 277 192 67315 Hu.ubiquitin specific protease 16, FLJ21451 fis 278 193 67657 Hu.lysosm1.membr.glycoprotein CD63 (melanoma Ag.1) 279 194 65369 MUC1 280 195 67316 Hu.Ste20-related serine/threonine kinase 281 196 65371 Hu. FLJ21480 fis, clone COL05034 282 197 65373 galectin-3 283 198 65376 Hu. PIST 284 199 65377 Hu.hypothetical protein FLJ10955 285 200 and 65379 Hu. clone 24627 mRNA sequence 286 201 202 66253 Hu.alpha 1,2-mannosidase 287 205 and 66256 Hu.cDNA FLJ10824 fis,FLJ 10683 fis;KIAA0592 protein 288 206 207 66257 Hu.poly(A)-binding pro., cytoplasmic 4 (inducible form) 289 208 67708 Hu.RNA-binding protein BRUNOL2 290 209 67710 Hu. kinesin 2; kinesin light-chain protein 291 210 67319 Hu.centromere protein F, mitosin,CENP-F kinetochore 292 211 67711 Hu.Rho-assoc.coiled-coil protein kinase2(ROCK2) 293 212 66266 Hu.eukaryotic translation initiation factor 4A, isoform 1 294 213 66270 Human retinal pigment epithelium 295 214 66271 Hu.peroxisome receptor 1 296 215 66273 Hu.laminin, gamma 1 (formerly LAMB2) 297 216 66277 Hu.TATA element modulatory factor 1 (TMF1) 298 217 67712 Hu.nuclear protein, ataxia-telangiectasia locus (NPAT) 299 218 66285 Hu.ADP-ribosyltransferase 300 219 66286 Hu.putative secreted protein ZSIG13, serine protease 301 220 66287 89% murine mSin3A (clone pVZmSin3A) 302 221 and 66292 Hu.FLJ22272 fis;HepG2 3′ region MboI clone hmd2g02m3 303 222 223 66294 Hu.MUK-binding inhibitory protein (MBIP) 304 224 66295 Hu. RAD50-2 protein (RAD50) 305 225 67400 Human mRNA for KIAA0150 gene 306 226 67402 Hu.mitchndr.tRNA;cDNA:FLJ22981 fis,cloneKAT11391 307 227 67403 Homo sapiens ribosomal protein L8 308 228 67405 Hu. transmembrane protein 4 (TMEM4) 309 229 67408 Hu.annexin A2, lipocortin II 310 230 67409 Hu.nuclear corepressor KAP-1,KRAB-associated 311 231 67410 Hu.BAC clone RP11-404P12 232 67412 Hu.ubiquitin fusion-degradation 1 protein 312 233 67715 Mu.zuotin related factor2,Hu.M-phase phosphoprotein 313 234 67417 Hu.cDNA FLJ11189 fis,FLJ12238 fis,FLJ12470 fis 314 235 67421 Hu.cDNA FLJ20425 fis, clone KAT02707 315 236 67422 Hu.cDNA FLJ21378 fis, cloneCOL03256;sorting nexin7 316 237 67322 Hu.collagenase stimulatory factor (EMMPRIN),basigin 317 238 67323 Hu.transforming acidic coiled-coil containing protein 3 318 239 67326 Hu.quinone oxidoreductase homolog (PIG3) 319 240 67331 HU.clone 24636; 91%Mu.uterine protein (LOC55978) 320 241 67717 Hu.annexin A1, lipocortin 321 242 67720 Hu.alpha 1,2-mannosidase 322 243 67722 Hu. mRMA for KIAA1398 protein 323 245 and 67731 Hu.cDNA FLJ11173 fis;sim.toestrogen-responsive B box 324 246 protein 247 68556 Hu.chromosome-associated protein-C (hCAP-C) 325 248 68572 Human DNA sequence from clone LA16-361A3 326 249 68563 Hu.laminin recptr.1(67kD, ribosomal pro.SA)(LAMR1) 327 250 68569 Hu.farnesyl diphosphate synthase;GAPDH region 328 251 69263 Hu.SH3-containing protein EEN and chromatin assemblyfactor- I p150 subunit 252 69268 Hu. villin 2 (ezrin) (VIL2) 329 253 69274 Hu.clone A9A2BR11 (CAC)n/(GTG)n repeat-containing 330 254 69275 Hu.clone 24793 ionotropic ATP receptor P2X5b 331 256 69280 Hu.aldo-keto reductase family 1, member B1 332 257 69283 Hu.talin (TLN) 333 258 69286 Human pancreatic mucin (tumor) 334 259 69689 Hu.macrophage galactose-specific lectin (hMAC-2) 260 69690 Human 28S ribosomal RNA gene 261 69696 Hu.trans-Golgi network protein(TGN51) 335 262 69703 Hu.KIAA0336 gene product 336 263 69705 Hu.mRNA for KIAA1398 protein 337

[0324] TABLE 4 cDNA MOLECULES ENCODING HUMAN OVARIAN TUMOR PROTEINS THAT SHOWED NO SIGNIFICANT SIMILARITY TO KNOWN SEQUENCES SEQ ID Clone NO: Identifier 175 65348 188 65363 203 and 66255 204 244 67723 255 69276

Example 6 Analisis Of Ovarian Tumor cDNA Expression Using Microarray Technology

[0325] In additional studies, sequences disclosed herein were found to be overexpresed in specific tumor tissues as determined by microarray analysis. Using this approach cDNA sequences were PCR amplified and their mRNA expression profiles in tumor and normal tissues were examined using cDNA microarray technology essentially as described (Schena, M. et al., 1995 Science 270:467-70). In brief, the clones were arrayed onto glass slides as multiple replicas, with each location corresponding to a unique cDNA clone (as many as 5500 clones can be arrayed on a single slide, or chip). Each chip was hybridized with a pair of cDNA probes that were fluorescence-labeled with Cy3 and Cy5, respectively. Typically, 1 μg of polyA⁺ RNA was used to generate each cDNA probe. After hybridization, the chips were scanned and the fluorescence intensity recorded for both Cy3 and Cy5 channels. There were multiple built-in quality control steps. First, the probe quality was generally monitored using a panel of ubiquitously expressed genes. Secondly, the control plate also included yeast DNA fragments of which complementary RNA was spiked into the probe synthesis for measuring the quality of the probe and the sensitivity of the analysis. Currently, the technology offers a sensitivity of about 1 in 100,000 copies of mRNA. Finally, the reproducibility of this technology can be ensured by including duplicated control cDNA elements at different locations.

[0326] cDNA clones disclosed herein and described in Table 5 were selected firstly on the basis of low expression in normal tissues, and secondly on some degree of overexpression in tumors. TABLE 5 cDNA MOLECULES ENCODING HUMAN OVARIAN TUMOR PROTEINS SHOWING AT LEAST SOME OVEREXPRESSION IN TUMORS SEQ ID NO of SEQ ID Clone GenBank NO: Identifier GemBank Search Results sequence 173 65346 Hu.26S proteasome subunit5a non-ATPase,4; antisecretory factor-1 264 179 and 65353 Hu. keratin 18 269 180 182 65356 84%Mu.partial Rabip4 FYVE-finger containing protein 271 187 65362 Hu.proteasome 26S subunit p27,non-ATPase, 9 (PSMD9) 275 188 65363 Novel 189 and 65364 Hu.cDNA FLJ10188 fis, clone HEMBA1004693 276 190 191 66251 Hu.eukaryotic translation initiation factor 4 gamma 277 192 65369 MUC1 280 196 65371 Hu.FLJ21480 fis, clone COL05034 282 200 and 65379 Hu.clone 24627 mRNA sequence 286 201 205 and 66256 Hu.cDNA FLJ10824 fis,FLJ10683 fis;KIAA0592 protein 288 206 207 66257 Hu.poly(A)-binding pro., cytoplasmic 4 (inducible form) 289 212 66266 Hu.eukaryotic translation initiation factor 4A, isoform 1 294 214 66271 Hu.peroxisome receptor 1 296 219 66286 Hu.putative secreted protein ZSIG13, serine protease 301 223 66294 Hu.MUK-binding inhibitory protein (MBIP) 304 224 66295 Hu.RAD50-2 protein (RAD50) 305

Example 7 Identification Of cDNAS Encoding Ovarian Tumor Proteins And Their Expression Profiles Using Microarrays

[0327] In this example, two additional clones derived from the library described in Example 5 were sequenced and their expression profiles determined.

[0328] Clone OVM-53 (clone identifier:66262) contains a 3493 base pair insert, the DNA and protein sequences of which are disclosed in SEQ ID NOs:338 and 340 respectively. Genbank analysis revealed that this clone had sequence homology with a human fatty-acid coenzyme A ligase, Genbank accession numbers AAZ33591 and AAT21760. Microarray analysis showed a tumor:normal tissue expression ratio of 1.14.

[0329] Clone OVM-65 (clone identifier 66269) contains a 1869 base pair insert, the DNA and protein sequences of which are disclosed in SEQ ID NOs:339 and 341 respectively. Genbank analysis revealed that this clone had sequence homology to the humans hypothetical protein, FLJ20651, Genbank accession number AAC26013. Microarray analysis showed a tumor:normal tissue expression ratio of 1.38.

Example 8 Analysis Of cDNA Expression Using Real-Time PCR

[0330] Real-time PCR (see Gibson et al., Genome Research 6:995-1001, 1996; Heid et al., Genome Research 6:986-994, 1996) is a technique that evaluates the level of PCR product accumulation during amplification. This technique permits quantitative evaluation of mRNA levels in multiple samples. Briefly, mRNA is extracted from tumor and normal tissue and cDNA is prepared using standard techniques. Real-time PCR is performed, for example, using a Perkin Elmer/Applied Biosystems (Foster City, Calif.) 7700 Prism instrument. Matching primers and fluorescent probes are designed for genes of interest using, for example, the primer express program provided by Perkin Elmer/Applied Biosystems (Foster City, Calif.). Optimal concentrations of primers and probes are initially determined by those of ordinary skill in the art, and control (e.g., P-actin) primers and probes are obtained commercially from, for example, Perkin Elmer/Applied Biosystems (Foster City, Calif.). To quantitate the amount of specific RNA in a sample, a standard curve is generated using a plasmid containing the gene of interest. Standard curves are generated using the Ct values determined in the real-time PCR, which are related to the initial cDNA concentration used in the assay. Standard dilutions ranging from 10-10⁶ copies of the gene of interest are generally sufficient. In addition, a standard curve is generated for the control sequence. This permits standardization of initial RNA content of a tissue sample to the amount of control for comparison purposes.

[0331] An alternative real-time PCR procedure can be carried out as follows: The first-strand cDNA to be used in the quantitative real-time PCR is synthesized from 20 μg of total RNA that is first treated with DNase I (e.g., Amplification Grade, Gibco BRL Life Technology, Gaithersburg, Md.), using Superscript Reverse Transcriptase (RT) (e.g., Gibco BRL Life Technology, Gaithersburg, Md.). Real-time PCR is performed, for example, with a GeneAmp™ 5700 sequence detection system (PE Biosystems, Foster City, Calif.). The 5700 system uses SYBR™ green, a fluorescent dye that only intercalates into double stranded DNA, and a set of gene-specific forward and reverse primers. The increase in fluorescence is monitored during the whole amplification process. The optimal concentration of primers is determined using a checkerboard approach and a pool of cDNAs from ovarian tumors are used in this process. The PCR reaction is performed in 25 μl volumes that include 2.5 μl of SYBR green buffer, 2 μl of cDNA template and 2.5 μl each of the forward and reverse primers for the gene of interest. The cDNAs used for RT reactions are diluted approximately 1:10 for each gene of interest and 1:100 for the β-actin control.

[0332] In order to quantitate the amount of specific cDNA (and hence initial mRNA) in the sample, a standard curve is generated for each run using the plasmid DNA containing the gene of interest. Standard curves are generated using the Ct values determined in the real-time PCR which are related to the initial cDNA concentration used in the assay. Standard dilution ranging from 20−2×10⁶ copies of the gene of interest are used for this purpose. In addition, a standard curve is generated for β-actin ranging from 200 fg-2000 fg. This enables standardization of the initial RNA content of a tissue sample to the amount of β-actin for comparison purposes. The mean copy number for each group of tissues tested is normalized to a constant amount of β-actin, allowing the evaluation of the over-expression levels seen with each of the genes.

Example 9 Peptide Priming Of T-helper Lines

[0333] Generation of CD4⁺ T helper lines, and identification of peptide epitopes derived from tumor-specific antigens that are capable of being recognized by CD4⁺ T cells in the context of HLA class II molecules, is carried out as follows:

[0334] Fifteen-mer peptides overlapping by 10 amino acids, derived from a tumor-specific antigen, are generated using standard procedures. Dendritic cells (DC) are derived from PBMC of a normal donor using GM-CSF and IL-4 by standard protocols. CD4⁺ T cells are generated from the same donor as the DC using MACS beads (Miltenyi Biotec, Auburn, Calif.) and negative selection. DC are pulsed overnight with pools of the 15-mer peptides, with each peptide at a final concentration of 0.25 μg/ml. Pulsed DC are washed and plated at 1×10⁴ cells/well of 96-well V-bottom plates and purified CD4⁺ T cells are added at 1×10⁵/well. Cultures are supplemented with 60 ng/ml IL-6 and 10 ng/ml IL-12 and incubated at 37° C. Cultures are restimulated as above on a weekly basis using DC generated and pulsed as above as antigen presenting cells, supplemented with 5 ng/ml IL-7 and 10 U/ml IL-2. Following four in vitro stimulation cycles, resulting CD4⁺ T cell lines (each line corresponding to one well) are tested for specific proliferation and cytokine production in response to the stimulating pools of peptide with an irrelevant pool of peptides used as a control.

Example 10 Generation Of Tumor-Specific CTL Lines Using In Vitro Whole-Gene Priming

[0335] Using in vitro whole-gene priming with tumor antigen-vaccinia infected DC (see, for example, Yee et al, The Journal of Immunology, 157(9):4079-86, 1996), human CTL lines are derived that specifically recognize autologous fibroblasts transduced with a specific tumor antigen, as determined by interferon-γ ELISPOT analysis. Specifically, dendritic cells (DC) are differentiated from monocyte cultures derived from PBMC of normal human donors by growing for five days in RPMI medium containing 10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml human IL-4. Following culture, DC are infected overnight with tumor antigen-recombinant vaccinia virus at a multiplicity of infection (M.O.I) of five, and matured overnight by the addition of 3 μg/ml CD40 ligand. Virus is then inactivated by UV irradiation. CD8+ T cells are isolated using a magnetic bead system, and priming cultures are initiated using standard culture techniques. Cultures are restimulated every 7-10 days using autologous primary fibroblasts retrovirally transduced with previously identified tumor antigens. Following four stimulation cycles, CD8+ T cell lines are identified that specifically produce interferon-γ when stimulated with tumor antigen-transduced autologous fibroblasts. Using a panel of HLA-mismatched B-LCL lines transduced with a vector expressing a tumor antigen, and measuring interferon-γ production by the CTL lines in an ELISPOT assay, the HLA restriction of the CTL lines is determined.

Example 11 Generation And Characterization Of Anti-Tumor Antigen Monoclonal Antibodies

[0336] Mouse monoclonal antibodies are raised against E. coli derived tumor antigen proteins as follows: Mice are immunized with Complete Freund's Adjuvant (CFA) containing 50 μg recombinant tumor protein, followed by a subsequent intraperitoneal boost with Incomplete Freund's Adjuvant (IFA) containing 10lg recombinant protein. Three days prior to removal of the spleens, the mice are immunized intravenously with approximately 50 μg of soluble recombinant protein. The spleen of a mouse with a positive titer to the tumor antigen is removed, and a single-cell suspension made and used for fusion to SP2/O myeloma cells to generate B cell hybridomas. The supernatants from the hybrid clones are tested by ELISA for specificity to recombinant tumor protein, and epitope mapped using peptides that span the entire tumor protein sequence. The mAbs are also tested by flow cytometry for their ability to detect tumor protein on the surface of cells stably transfected with the cDNA encoding the tumor protein.

Example 12 Synthesis Of Polypeptides

[0337] Polypeptides may be synthesized on a Perkin Elmer/Applied Biosystems Division 430A peptide synthesizer using FMOC chemistry with HPTU (O-Benzotriazole-N,N,N′,N′-tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a method of conjugation, binding to an immobilized surface, or labeling of the peptide. Cleavage of the peptides from the solid support may be carried out using the following cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, the peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets may then be dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior to purification by C18 reverse phase HPLC. A gradient of 0%-60% acetonitrile (containing 0.1% TFA) in water (containing 0.1% TFA) may be used to elute the peptides. Following lyophilization of the pure fractions, the peptides may be characterized using electrospray or other types of mass spectrometry and by amino acid analysis.

[0338] From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.

1 341 1 208 DNA Homo sapiens misc_feature (1)...(208) n=A,T,C or G 1 gatcccccgg gctgcaggaa ttcggcacga gggtttcatt cttgacncta ataanttgan 60 gctttnaatt nctaantgag gcncttttta ctagcntcgc tgaaacanac tagaagacat 120 acataantgg natggatgtc ttgtgcttnt ggntggaaca gatngaatga tgttaanaca 180 ncaatactat ccaatatgat ntgcccac 208 2 903 DNA Homo sapiens misc_feature (1)...(903) n=A,T,C or G 2 ttgatatcga attcggcacg aggctggttt ttggcaagga tgagcctgga gggttgggtt 60 tcaaagaagg aggcgacatt ccattccatc agagcgtcac catgtggagg ccacggcctg 120 actgcagata ggaaagaaaa cgctctaaag ccccagatat gactacattt gttctcaacc 180 tgaacttcgc ctcaaggtaa tcttgggagg taaatttgat ttctgagaat cgcttcacct 240 gttggagaga aagcacgcca tgcacatcag cctccccttc ctggtgagaa gagggaagct 300 ggagcctctc cagagggaca ggaaatccct ataaatggcc aatggccgct gacatccaag 360 caggccaatt aaaaggaaaa tacatcagtt tacagccaag gcaaccccac accatttcat 420 aacacggggc taggcacagc ttgtgccttg gtggtccagt cctgtgccca cgtcagagct 480 aagcacttgc tacgctcctt tttggcctga aatcccacag acctggttct gagttccttc 540 ctccccactc ctatgacctt ggtcacgtct ctgagcctca tgtcacaggt ttccacagga 600 agaatgtgca gatccttcct gaatcggccc catgagccaa atggattgaa gaacggntat 660 cattcagtca actacgcgtg ggaaccgccg ggatctcctg ggggatgggc agngtacgag 720 tacccaagca ttccatttgt tcactggngg ccaatcacgc ccctaataaa atgaagacct 780 ttgtgtgaat gggcttttga aaccanctta ccaataaaat ctggttattc cccatnattn 840 tttncttaan aanaaattta tttttgggac ttactnttaa cccctttcct aaaggtctca 900 caa 903 3 904 DNA Homo sapiens misc_feature (1)...(904) n=A,T,C or G 3 naagctggnc tcccgcggtg gcggccgctc tagaactagt ggatcccccg ggctgcaggt 60 caaatttaaa taagtttgct tttaaaaaac aaagtacata ataatataga ggcaaattaa 120 gtaacacatt aaaatgaact aaagttctat tactgcattt tactgtataa ttaggaatga 180 gatacaacag aacaatcttt ggaaaaatct catagcagaa tcattttaaa agttaaacag 240 tctcatacta attcaaagag gtttgtaggg catattttag tctcattcta attccctaaa 300 ttggatttat atatccagag aataggcagc catatttggc aatggagagg gaatctaata 360 ccataaatca tgtaatatta tacacagaaa acaggctttt aaaaaataaa gtgaatttga 420 aatatttttc ataataaaag taactaaact aataaaatac tacattattt gtacaatacg 480 aaaagtcaat aggtacaaag gttgaatgtt tcaaaaagag aagtataacc aagcttaaca 540 tcatttctta ctcaaaagat acctcgataa acatactctg ttctcattat taccagtaat 600 ccccccttga tggattgctt ggagaaagaa tgcttaggcc ttaaactcca ctgccttggt 660 ttggcagtca gctgaagggg ccagttgagg aaaaaacccc agtgagttct agtcagactc 720 cattcaattg taaatggana agtatcacta accatcnccc tttcaccgct aatctganat 780 cgctagncaa aaatattcaa gaatttttgg gtaaagtttc tgtttaagac tcnggatttn 840 aaatccctat ttnttttggn gangtttnaa aaatnatnan ctttcctccn ccccccaact 900 tttt 904 4 835 DNA Homo sapiens misc_feature (1)...(835) n=A,T,C or G 4 ttggcaacgt taactacctg gtaccgggcc ccccctcgag tttttttttt tttttttttt 60 gganatggag tctccctcta tcgcccaggc tggagtgcag tagcacagtc tcggctgcca 120 ggcctcccca atctcttctt ctcacagggc agggctgaac agtatctcct cctggctgca 180 tccgagccca gaagtcaaaa cctaaggggt ccctgcttgg ctgggacaga ggccgcagac 240 ctaggtccag ctctgactcg acagtggcac atccctgtga agtagggctt gggggcttcc 300 catgaacagc tttcaaaagt gaagtgcant aaaaatgggg cttggaggga caagttttga 360 caagancttg gagcaagggg ctttgaattt ttaaccccaa cantccaccc ctttacccca 420 ttgaacccaa ggtgaacant ttcaaaagga ggggttcacc tggcccttgc aaaagtaaag 480 aagctgcaaa aatcaacaag cttnttgcac caaaaggcaa aaatggcang ggaaacaagg 540 ggaagggaaa tggtccaata ntttaatgcc ccaanctgca ncaagttngg cttgggaacc 600 caagncctgg gnctggctca tgactgggca atcatctttg gcatatttct ggtggctata 660 cttttattaa aaagaatggg aaaaacangg ggcngnaaac tanntatagt ttctaattnc 720 cagnaaaaaa atttgtncaa ttaacaantt gncctgganc ctaaaacnnc ccttgccttn 780 cccgggacct ttaanccctt tnccccccgg ggaaggnccc cattggaggn tccag 835 5 810 DNA Homo sapiens misc_feature (1)...(810) n=A,T,C or G 5 cntngcnatg ttccttacct ggaccgggcc ccccctcnag tcnatgggca tctttnnngg 60 caaccttttn tatttttatt ttttattttt ttttggttac gaaaacttgt caatatggca 120 atgtcattga atttttttct ttttcattcg ctgtatccat cagattcaat ctttaataca 180 gctcatattt gtctctttct atccagtctc atttgtacat tgttagacaa gtgtttatca 240 ctaatctgga atacatcatc ttcaataagg ctcttgtttt cctccaagct gcactgctca 300 cactgctcag ttttctggta agcaacctgc tcattatagt agagccccca ggggatctgg 360 tcttctgntc ttcagaaagt cctatttctt gtgcaacagg gctacntgan ttgnanaccc 420 ncaagtcata cgaataacat tttttccatt ccctccggat atcttcncat ncataanata 480 aacccatgna acncnaaatt ncctataant cacnccttga aagctanaan aaagacttcn 540 nccaggnaat ggnnaaaaaa cccttgggtt tggaaacnca atgggttttt aaggggaaat 600 tgccttggtt antttaaaaa cccccaaggn ggccctttna agagaattaa ccggnaaaaa 660 ggggtgggaa aggccngggc cnttttgccc aaatnttttt tttccaaagg gggtttgnat 720 tggaaaaaat nttttcngng gggggnntct tttaaacnan cccngcccng ggaanctttt 780 tttttttncc ccaaggaaca aaacctccaa 810 6 247 DNA Homo sapiens misc_feature (1)...(247) n=A,T,C or G 6 tntgtttttn acctggganc gggccnctct agcccgggcg gatcccccgg gctgcaggaa 60 ttcggcatga ggcggtgaag ggccgcgggg ccgatnncgg gaagtggcgt ccagtcngtt 120 tttggcnggc gagagccaga gcgtgagaac cctcccacgg cgaggccaag ccctgagact 180 ggactagcaa ggtcgtgagt ctctctgtca gctctccgca nccaaagggc gcggggggcg 240 gggaccc 247 7 689 DNA Homo sapiens misc_feature (1)...(689) n=A,T,C or G 7 agggtctttt tcttctaatg actagatttg aataagtttc acaatagtaa aggatgttaa 60 caagaccttt taggtaaggc caattcaatg aactaattca atatactgtt gttgtagatt 120 ctacaacttg atatacagtt ggagtagatt tgttatatac ttgaatatgt gtgtcctata 180 agtgtctgag ataattcact cagttggaaa gagctgcata tagagaagtg ctaaaactgt 240 taggtctgca taaggttagt ggttttttat tttaggagac tctgtgattt atggcttgat 300 gttggagtct ctgggaaagg aaggaagttg tacccaaaaa agcctctgga aggtgctgta 360 gtcataatct ttctcacaga actaccaggg agtcctcttg aaaagcaaat ttctctgaac 420 tcatcaaaac ttaaaacctc acccttgaag actatgtgat tctcccacat gatgtcccca 480 ttttgaggtg ggncactcaa gactcaacaa caagntttgg aagtgcccca agttcctgac 540 ttgncaaaaa agaaatagtt natttaagaa cacagctgac atgccctccn cactccttgg 600 ctttanattt aaaaantggg atttatcttt ccaaataaac ttattccccc nctgctgcta 660 aagcacacng ccggctcntt gatgcttcc 689 8 843 DNA Homo sapiens misc_feature (1)...(843) n=A,T,C or G 8 ctcttcgana ntcgggaatc tgaatntnta tgatgagagc taantntccc tgacctttgc 60 tttantgaga gacattactc tttataaact acacagtaaa caaaccttcc cttttattca 120 gaggaagaca caatctcttc caaggctgtt tgtcagaaaa caattagtgc cttttatggt 180 aatagatgca gaaatgtgag acatttggag aattgtctct caacaatgtt tgctgctagt 240 ttcgatacca ctcttggctt cttgtgattt tttccataat tatataactt ggttagcctc 300 accgatgaat ctgatcaatg gtggttggac aaaatatgtc cagtttgtct tccataatat 360 ttaattaaaa ttactatcaa caggactcta ataagcagaa tagggncaat gtgtattatt 420 gacctcaacc cacaggcagg ngccccggac ccagccaact tagcaaggtc ttattaaacc 480 atccggatgc acccttanaa atccaggtat ctttctcctt aagttnggat ccggnctttt 540 cccttggaat gnggcattta catagggncc accaaggata tattcagcat tgcnctggcc 600 cattcttaac ttccatggag ggaanggnaa ggacnattcc tttntttgtn accacccctg 660 ggccaggtca ccattantaa naagttgaag ggttggcctg gnaanttgca tccntttaat 720 cccctnccca cctnaacgna angaacccaa atttttgcaa ccancttttt cntaaattgg 780 tttnggacta atnaaaccat taaggggaaa ngggtnttcc nccataanng ggggccatta 840 aaa 843 9 714 DNA Homo sapiens misc_feature (1)...(714) n=A,T,C or G 9 cggatccccc gggctgcngg aattcggcac gaggtaaagg taaaggggat taaaactact 60 tttnttttgg gaagagagaa gatacaatac tggagannga ggaaattgat ttgttttacc 120 ctagaaatgt tttattaanc ccncgatgtg ttgtcagtat taatttgtaa tacgcaaata 180 attctcaact gcatttgaag aaaacttaaa acatcttgga tagggtaaat atatcaaaat 240 tctacatttg tagcctgaca gagcctcatt tcactcacgg ggcactagtc tttttcactt 300 acatagagca gttcaggatc agttgatgct gngatgttgt gtagtcagct gnggcatacg 360 nggaccgtag gactttattt gntaacctac tctgcattct aaatctgaaa ttgccagcaa 420 gcattttaat gccaagcatc ctggtagagn aaataacatt gcaaacagaa tctgnggaga 480 acacaaaatg ggcctggcta ccncccctta tttcatgccc caaanagtna agatttggat 540 ggggcancac ctttatatta ccttaaacaa gncattnggg gggntcntat gccgganaat 600 tggacccccc cggggggggg gggnnttant ttttcanccc caaaaaaggg gancntnacc 660 aaaatttncc cggggncccc ggggggggtt canttttttt nanggngggg gggg 714 10 538 DNA Homo sapiens misc_feature (1)...(538) n=A,T,C or G 10 ttttccctgg ntggtttttt ttttncnctc tcttaaaaag gaaacctcag ggcctttttn 60 atttgtaana nncattttag gatgaaaaat aanngcncaa ccncganagg gcaaatctca 120 gcataanaaa ccccaaatga cttgntttaa gctaatataa aagtagatgc tcaatacaaa 180 tncttaactc tttctgggca tgaaataana ggctgtancc aggaacattt tgngttctac 240 ncanattctg tttgcaatgn tatttactct aacaggangc tnggcattaa aatgcttgct 300 ggcaatttca natttaaaat gcanagnagg ntaacaaata aagtcctacg ggccncgtnt 360 gccacagctg actacacaac atcncagcat caactgatcc tgaactgntc tatgtaaggg 420 aaaaagacta gggccccgtg agtgaaatga ggntctgtca ggctncaaat gnanaatttn 480 gatatattta ccctatccaa nangttttaa ggttttcntc aaaagccngg nggaaaaa 538 11 254 DNA Homo sapiens misc_feature (1)...(254) n=A,T,C or G 11 acccacccaa atttaaccca gtaggggaaa acaaaaaaag ggcaaattat ntgggaggga 60 atggaccgcc nttcaaacac aaacnttctt tcccgcaaaa acaagncaag nctattcaaa 120 ggaaggtnta atnttttnct tgtacatggg tcaaagnttt tcaaaaccat ntcaagngnt 180 ctggtccccc aaaccagggt caccaacnac gccggggggg tggggnacnt ccctcgtgta 240 aaccttctcg gcca 254 12 227 DNA Homo sapiens misc_feature (1)...(227) n=A,T,C or G 12 ccatcaccaa cctgcagtat agtgttttct aaatgcagac tatagtggca cctaaatgna 60 nactatagtg tcacctaaat gcagactata gtgtcaccta aatgcanact gtagtgtcac 120 ctaaatgcag actatagtgt cacctaaatg cagactatag tgtcacctaa atgcagacta 180 tagtgtcacc taaatgcaga ctatagtgtc acctaaatgc agaagcc 227 13 760 DNA Homo sapiens misc_feature (1)...(760) n=A,T,C or G 13 catattgccc attgtattac agcggaagaa actgaggtat ggacaggtaa catgtccatg 60 gtcacttggc tggtgagggg cagagaggag atttgaaacc aaatctgact cactagtgtg 120 gccgtaacca tggtaactat gtctctctac catgtggtct cctctttatt aaaggaaggg 180 caagttctgg gagttttggg agttttgggc ttgagtgggg aagggtagcc aagtaaagca 240 ggtgagagaa ggtctgcttt aaggactgct gtttgatttt tattgttgtt gttcagtgtt 300 caatgggatt gagttgactc ttttttccct tcttgttccc caaagcatga gactgttccg 360 gtccttttcc cttttaactt ctcagctaga gtttgttagg gcgggtatgg gcacctggca 420 gagtctgaga cctcagcttc cagtaggcac acgttctgac ccaatacacc taccctggtc 480 ccctaacctg cttctggtcc cctaacctgc ttctgggccc aggtaatgca ttttaggaac 540 atcccacttt tctccttacc tggctttcca ttatccgtcc aaactaaagc acccacctgt 600 ctgcttcaga ctcttgcttc aagcactccg tctgggtcct canaaattga cttacagtca 660 gttcanatct gactcaggcg tggccttttt tctccttctt gcagcaccac agtcccattc 720 tggngccnnt tcacccttaa cntttnccca ttnancccaa 760 14 632 DNA Homo sapiens misc_feature (1)...(632) n=A,T,C or G 14 gctcgggctg ngagctcagg caggtgggac caggggctga agcccatggg gaggtgggag 60 gagcgatgcc gtgctccatc cggaccccct cagaggctcc tgggctgctg gggcacagcg 120 ggacatgctc ctgagtcccg cagacctggt tcaagtccag tgtctggttt ttattagcct 180 tctgtctgcg ggaatatctt gcctctgttt ctctccctct cttcttctcc ttccttctct 240 cttctctcac cttcatgcag tgacatataa aggtcacgag gacagaccct cctgcagcca 300 gattgctggg ttcatggttc aaatcccggg ggntctgcca cctcctggct ctatgcctga 360 cggtgactca cccaaacctc ctgtgtccca aattcctcat gtgaaacaga ggcaatagaa 420 gagctgtcct ggtagaatcc gtttatggca nacttgagtt caggtacaca cggcgctgac 480 atcagngctg attagaaaac cccaaaggag ggatgctcct attaatactg aggaaggatn 540 ttgnnctcac angggactgt gncangcact ggancctccg gatttggatg caccaagaat 600 gaggagaaat ggccctcccg ncttgtgaan tt 632 15 449 DNA Homo sapiens misc_feature (1)...(449) n=A,T,C or G 15 ggaggacacg gggtcataag ccattcgcgg ccccttcccc acctgggttt ctatccccag 60 agtacgtcct tggacctaga cccggtgact gcctgtgagg ctcgggctgt gagctcaggc 120 aggtgggacc aggggctgaa gccacatggg gaggtgggag gagcgatgcc gtgctccatc 180 cggaccccct cagaggctcc tgggctgctg gggcacagcg ggacatgctc ctgagtcccg 240 cagacctggt tcaagtccag tgtctggttt ttattagcct tctgtctgcg ggaatatctt 300 gcctctgntt ctctccctct cttcttctcc ttccttctct cttctctcac cttcatgcaa 360 gtgacatata aaggtcacga ggacagaccc tcctgcagcc agattgctgg gctcatggtt 420 caaatnccgg gggttctnnc acctgctgg 449 16 537 DNA Homo sapiens 16 gaactcatct tccccgggct cgagtgcggc cgcaagcttt tttttttttt ttggtaacca 60 ttctgaagct gttttgaact gagagaaaat gaatgctatt atcaattggt gatgtctgcc 120 atgaatgctg caatcaatta atgatgtctg tcatggatga gggagtggag cagatgccat 180 atagttttcc agtgtaaact ctggcagaca ctaaagttga accaaggcag aagaggaaag 240 tttccataca ttataagaaa gcatacttca ttccttcctt ccctgggaaa atccatattt 300 tcctgtagat ctaaatctgt cgatctacca tgtacaaggt cctgaagcag atactaggga 360 agcacaaggg agtaagacac agttgctcca atcagcaggt ttatgcacta ttggggtgac 420 tggatataaa cacaatgttc aaagcagaga aactgtgttt gcaattgact taaaaaatat 480 gcaataggac ttaaagtgag tgacaaaata aaaggttccc atgattcttc tgcctat 537 17 584 DNA Homo sapiens misc_feature (1)...(584) n=A,T,C or G 17 gatgatgatg gtgatgttgn taatgatggt ggtgattatg acaataatga tgatgatggt 60 gacagggatg gtgatgatnn tgatggtggt ggtgataaca aagttaatgg ataatatatg 120 aacttattgg ctactgaata tgcaccaaag tgctatgctc agtgtttaac tagtactatt 180 taatatgatt tctaaaaaaa atcttgaatt attataggca gaagaatcat gggaaccttt 240 tattttgtca ctcactttaa gtcctattgc atatttttta agtcaattgc aaacacagtt 300 tctctgcttt gaacattgtg tttatatcca gtcaccccaa tagtgcataa acctgctgat 360 tggancaact gtgtcttact cccttgtgct tccctagtat ctgcttcagg accttgtaca 420 tggtanatcg acanatttan atctacagga aaatntggat tttcccaggg aaggaangaa 480 tgaacatgct ttcnttatna tntatggaaa ctttcctctt ctgccttggt tcaaccttta 540 antgnctgcc nnaaattanc acctnganaa acntcttatg ggcc 584 18 622 DNA Homo sapiens misc_feature (1)...(622) n=A,T,C or G 18 ctcatcttcc ccgggctcga gtgcggccgc aagctttttt tttttttttg gtaaccattc 60 tgaagctgtt ttgaactgtt agaaaatgaa tgctattatc aattggtgan ggctgccatg 120 aatgctgcaa tcaattaatg atgtctgtca tggatgaggg agtggagcag atgccatata 180 gttttccagt gtaaactctg gcagacacta aagttgaacc aaggcagaag aggaaagttt 240 ccatacatta taagaaagca tacttcattc cttccttccc tgggaaaatc catattttcc 300 tgtagatcta aatctgtcga tctaccatgt acaaggtcct gaagcagata ctagggaagc 360 acaagggagt aagacacagt tgctccaatc agcaggttta tgcactattg gggtgactgg 420 atataaacac aatgttcaaa gcagagaaac tgtgtttgca attgacttaa aaaatatgca 480 atacngactt aaagtgagtg acanaataaa aggntcccat gattcttctg cctataataa 540 ttcaagattt tttntagaaa tcatattaaa tagnactagt taaacactga gcatagcact 600 ttggtgcata ttccagcncc ca 622 19 501 DNA Homo sapiens misc_feature (1)...(501) n=A,T,C or G 19 cctttantcc tattgcatat tttttaagtc aattgcaaac acagtttctc tgctttgaac 60 attgtgttta tatccagtca ccccaatagt gcataaacct gctgattgga gcaactgtgt 120 cttactccct tgtgcttccc tagtatctgc ttcaggacct tgtacatggt agatcgacag 180 atttagatct acaggaaaat atggattttc ccagggaagg aaggaatgaa gtatgctttc 240 ttataatgta tggaaacttt cctcttctgc cttggttcaa ctttagtgtc tgccagagtt 300 tacactggaa aactatatgg catctgctcc actccctcat ccatgacaga catcattaat 360 tgattgcagc attcatggca gacatcacca attgataata gcattcattt tctctcagtt 420 caaaacagct tcagaatggt taccaaaaaa aaaaaaaaag cttgcggccg cactcgagcc 480 cgggtgaatg attgagttta a 501 20 361 DNA Homo sapiens misc_feature (1)...(361) n=A,T,C or G 20 cntattctgc ttgaggacct tgtacatggt agatcgacag atttagatct acaggaaaat 60 atggattttc ccagggaagg aaggaatgaa gtatgctttc ttataatgta tggaaacttt 120 cctcttctgc cttggttcaa ctttagtgtc tgccagagtt tacactggaa aactatatgg 180 catctgctcc actccctcat ccatgacaga catcattaat tgattgcagc attcatggca 240 gacatcacca attgataata gcattcattt tctctcagtt caaaacagct tcagaatggt 300 taccaaaaaa aaaaaaaaag cttgcggccg cactcgagcc cgggtgaatg attgagttta 360 a 361 21 392 DNA Homo sapiens 21 cgctgatgga gcaactgtgt cttactccct tgtgcttccc tagtatctgc ttcaggacct 60 tgtacatggt agatcgacag atttagatct acaggaaaat atggattttc ccagggaagg 120 aaggaatgaa gtatgctttc ttataatgta tggaaacttt cctcttctgc cttggttcaa 180 ctttagtgtc tgccagagtt tacactggaa aactatatgg catctgctcc actccctcat 240 ccatgacaga catcattaat tgattgcagc attcatggca gacatcacca attgataata 300 gcattcattt tctctcagtt caaaacagct tcagaatggt taccaaaaaa aaaaaaaaag 360 cttgcggccg cactcgagcc cgggtgaatg at 392 22 374 DNA Homo sapiens misc_feature (1)...(374) n=A,T,C or G 22 ttnnngttgc tccaatcagc aggtttatgc actattgggg tgactggata taaacacaat 60 gttcaaagca gagaaactgt gtttgcaatt gacttaaaaa atatgcaata ggacttaaag 120 tgagtgacaa aataaaaggt tcccatgatt cttctgccta taataattca agattttttt 180 tagaaatcat attaaatagt actagttaaa cactgagcat agcactttgg tgcatattca 240 gtagccaata agttcatata ttatccatta actttgttat caccaccacc atcataatca 300 tcaccatccc tgtcaccatc atcatcatta ttgtcataat caccaccatc attaccaaca 360 tcaccatcat catc 374 23 300 DNA Homo sapiens 23 aactgtgttt gcaattgact taaaaaatat gcaataggac ttaaagtgag tgacaaaata 60 aaaggttccc atgattcttc tgcctataat aattcaagat tttttttaga aatcatatta 120 aatagtacta gttaaacact gagcatagca ctttggtgca tattcagtag ccaataagtt 180 catatattat ccattaactt tgttatcacc accaccatca taatcatcac catccctgtc 240 accatcatca tcattattgt cataatcacc accatcatta ccaacatcac catcatcatc 300 24 828 DNA Homo sapiens misc_feature (1)...(828) n=A,T,C or G 24 tgttaactna tcttcacccg ggctcgagtg cggccgcaag cttttttttt ttttttggta 60 accattctga agctgttttg aactgagaga aaatgaatgc tattatcaat tggtgatgtc 120 tgccatgaat gctgcaatca attaatgatg tctgtcatgg atgagggagt ggagcagatg 180 ccatatagtt ttccagtgta aactctggca gacactaaag ttgaaccaag gcagaagagg 240 aaagtttcca tacattataa gaaagcatac ttcattcctt ccttccctgg gaaaatccat 300 attttcctgt agatctaaat ctgtcgatct accatgtaca aggtcctgaa gcagatacta 360 gggaagcaca agggagtaag acacagttgc tccaatcagc aggtttatgc actattgggg 420 tgactggata taaacacaat gttcaaagca gagaaactgt gtttgcaatt gacttaaaaa 480 atatgcaata ggacttaaag tgagtgacaa aataaaaggt tcccatgatt cttctgccta 540 taataattca agattttttt tagaaatcat attaaatagt actagttaaa cactgagcat 600 agcactttgg tgcatattca gtagccaata agttcatata ttatccatta actttgttat 660 caccaccacc atcataatca tcaccatccc tgtcacccat ccattcattc atttatttgg 720 nccattaaaa tccaccccac cccattcatt taccccaaac catttcaccc tnatcatcgc 780 ttgaattcgg atccgatttc ncatggnctt gtcgncncgt cgggaccc 828 25 712 DNA Homo sapiens misc_feature (1)...(712) n=A,T,C or G 25 atccgaantc aancgatgat gatggtgatg ttggtaatga tggtggtgat tatgacaata 60 atgatgatga tggtgacagg gatggtgatg attatgatgg tggtggtgat aacaaagtta 120 atggataata tatgaactta ttggctactg aatatgcacc aaagtgctat gctcagtgtt 180 taactagtac tatttaatat gatttctaaa aaaaatcttg aattattata ggcagaagaa 240 tcatgggaac cttttatttt gtcactcact ttaagtccta ttgcatattt tttaagtcaa 300 ttgcaaacac agtttctctg ctttgaacat tgtgtttata tccagtcacc ccaatagtgc 360 ataaacctgc tgattggagc aactgtgtct tactcccttg tgcttcccta gtatctgctt 420 caggaccttg tacatggtag atcgacagat ttagatctac aggaaaatat ggattttccc 480 agggaaggaa ggaatgaagt atgctttctt ataatgtatg gaaactttcc tcttctgcct 540 tggttcaact ttagtgtctg ccnnagttta cactggaaaa ctatatggga tctgctccac 600 tcccctcatc catggaccag acatcattta aattggattg ccagcccttt ccannggggc 660 caaggaaccc nttcccaccc ccccnaattt ttgggggaat ttnaaaaatt tt 712 26 826 DNA Homo sapiens misc_feature (1)...(826) n=A,T,C or G 26 ttnttccccg ggctcgagtg cggccgcaag cttttttttt ttttttggta accattctga 60 agctgttttg aactgataga aaatgaatgc tattatcaat tggtgatgtc tgccatgaat 120 gctgcaatca attaatgatg tctgtcatgg atgagggagt ggagcagatg ccatatagtt 180 ttccagtgta aactctggca gacactaaag ttgaaccaag gcagaagagg aaagtttcca 240 tacattataa gaaagcatac ttcattcctt ccttccctgg gaaaatccat attttcctgt 300 agatctaaat ctgtcgatct accatgtaca aggtcctgaa gcagatacta gggaagcaca 360 agggagtaag acacagttgc tccaatcagc aggtttatgc actattgggg tgactggata 420 taaacacaat gttcaaagca gagaaactgt gtttgcaatt gacttaaaaa atatgcaata 480 ggacttaaag tgagtgacaa aataaaaggt tcccatgatt cttctgccta taataattca 540 agattttttt tagaaatcat attaaatagt actagttaaa cactgagcat agcactttgg 600 tgcatattca gtagccaata agttcatata ttatccatta actttgttat caccaccacc 660 atcntaatca tcaccatccc tgtcaccatc atcatcatta ttgncataat caccaccatc 720 attaccaaca tcccatcatc atcattaccc atcatcaccc ccaccatcac cattggcatc 780 ntggggcnta ntccccctca tcatcntnca tcnttggccc cccatc 826 27 832 DNA Homo sapiens misc_feature (1)...(832) n=A,T,C or G 27 ttnacagttg tccaatcagc aggtttatgc actattgggg tgactggata taaacacaat 60 gttcaaagca gagaaactgt gtttgcaatt gacttaaaaa atatgcaata ggacttaaag 120 tgagtgacaa aataaaaggt tcccatgatt cttctgccta taataattca agattttttt 180 tagaaatcat attaaatagt actagttaaa cactgagcat agcactttgg tgcatattca 240 gtagccaata agttcatata ttatccatta actttgttat caccaccacc atcataatca 300 tcaccatccc tgtcaccatc atcatcatta ttgtcataat caccaccatc attaccaaca 360 tcaccatcat catcattacc atcatcacca ccaccatcac cattgtcatc atggtcataa 420 tcaccatcat catcatcatc attgccacca tcagtatcat cataaaaata gctctctgag 480 aggagagatt attaattgtt tcatgatcca tgccatatcc ccaggatttg gcagagcccc 540 agggcttgtc acaatgctca ataaagtttg ttggttggct catcatcact agatttataa 600 agaatttcca tctttatttg actgaataat ggaagagatg tttacactca tcagccatgc 660 tctaacagag gtgcccattg gctgggtacc ttttatttca aagggggacc cagttaggaa 720 aagggggggg tgggggattt ccccccnaga gtttccccca nggtccccct gtggggtang 780 tccccnaang aaggccttcc ccngggagga aaaaaccccc ctttcttttt cc 832 28 818 DNA Homo sapiens misc_feature (1)...(818) n=A,T,C or G 28 tngtttccta cattataaga aagcatactt cattccttcc ttccctggga aaatccatat 60 tttcctgtag atctaaatct gtcgatctac catgtacaag gtcctgaagc agatactagg 120 gaagcacaag ggagtaagac acagttgctc caatcagcag gtttatgcac tattggggtg 180 actggatata aacacaatgt tcaaagcaga gaaactgtgt ttgcaattga cttaaaaaat 240 atgcaatagg acttaaagtg agtgacaaaa taaaaggttc ccatgattct tctgcctata 300 ataattcaag atttttttta gaaatcatat taaatagtac tagttaaaca ctgagcatag 360 cactttggtg catattcagt agccaataag ttcatatatt atccattaac tttgttatca 420 ccaccaccat cataatcatc accatccctg tcaccatcat catcattatt gtcataatca 480 ccaccatcat taccaacatc accatcatca tcattaccat catcaccacc accatcacca 540 ttgtcatcat ggtcataatc accatcatca tcatcatcat tgccaccatc agtatcatca 600 taaaaatagc tctctgagag gagagattat taattgtttc atgatccatg ccatatcccc 660 aggatttggg caganccccc agggcttggt ccancaaatt gcctccaaaa tnaaaaagtt 720 ttttgntttg ggggtttggg ggccttccaa ttcatcacta gattttataa aagaattttc 780 ccatctttta ttttgactgg aataatggaa gaagatga 818 29 557 DNA Homo sapiens misc_feature (1)...(557) n=A,T,C or G 29 aaggggctgg gagggcacaa gntattccct ttcccttctg gccagctcca gagagagacc 60 cagctcaggc ccgatatgca gcaaggcctg taaatagttt tatttgctga cctttctgcc 120 atgagaggct tggatgcttc ccctgaagag ggtttctctg tagctcttgg gactaccaca 180 gtggacctgg gaaactctgg ggatccaccc cttctactgg tcccttgaat aagtaccagc 240 caatggcacc tctgttagag catggctgat gagtgtaaac atctcttcca ttattcagtc 300 aaataaagat ggaaattctt tataaatcta gtgatgatga gccaaccaac aaactttatt 360 gagcattgtg acaagccctg gggctctgcc aaatcctggg gatatggcat ggatcatgaa 420 acaattaata atctctcctc tcagagagct atttttatga tgatactgat ggtggcaatg 480 atgatgatga tgatggtgat tatgaccatg atgacaatgg tgatggtggt ggtgatgang 540 gnaatgatga tggacgg 557 30 580 DNA Homo sapiens misc_feature (1)...(580) n=A,T,C or G 30 cttctggcca gctccagaga gagacccagc tcaggcccga tatgcagcaa ggcctgtaaa 60 tacgttttat ttgctgacct ttctgccatg agaggcttgg atgcttcccc tgaagagggt 120 ttctctgtag ctcttgggac taccacagtg gacctgggaa actctgggga tccacccctt 180 ctactggtcc cttgaataag taccagccaa tggcacctct gttagagcat ggctgatgag 240 tgtaaacatc tcttccatta ttcagtcaaa taaagatgga aattctttat aaatctagtg 300 atgatgagcc aaccaacaaa ctttattgag cattgtgaca agccctgggg ctctgccaaa 360 tcctggggat atggcatgga tcatgaaaca attaataatc tctcctctca gagagctatt 420 tttatgatga tactgatggt ggcaatgatg atgatgatga tggtgattat gaccatgatg 480 acaatggtga tggtggtggt gatgatggta atgatgatga tggtgatgtt ggtaatgatg 540 gnggngatta tgacaatnat gatgatgatg gngacnggga 580 31 277 DNA Homo sapiens misc_feature (1)...(277) n=A,T,C or G 31 cctttgcggg ttaactcatc attcaccggg ctcaatgcgg ccgcaagctt tttttttttt 60 ggttngggcc ttcgaactct tntctcccta ncatatcctg ggtggagatn cacgncccat 120 ttctctatga nacctgtgca cacacaangg nccgcanntt tctgtgcaaa ncgctcanaa 180 cgcangtgac tttccataat gggaacnctc atgggtgctg ggggaanacc cgagannant 240 ttaaccnatg gnccaccacc ttctanaaga ggaaaaa 277 32 311 DNA Homo sapiens misc_feature (1)...(311) n=A,T,C or G 32 agcacttggt cttggctcca gtagcctctt gaggtggaga gtggtgggga gccctaccnt 60 ] gtagccttca gtggatgctc tccccttggt cacctctgga acaaacaggt tccttccctg 120 gggtccatan ggagggacct catggggctg tcacatccta ncaaccttct tagcctcctt 180 ccttccttcc acacggccca naggccagct cggncccacc cagagggcct ggccagctgc 240 tcccaancca cccagggagg cagcagacag gtagggagct cctggggcac ggccgtggna 300 ttgctttcac c 311 33 243 DNA Homo sapiens misc_feature (1)...(243) n=A,T,C or G 33 cctttttttt tttntttcct gattactttt tattttattt agagtaagtg acatatttgt 60 ttccccactt acagatgana aatatctgaa gtggaagtgg agtgacgtgg ttgtaatgag 120 ggtanctcct catctcctga ttgaggtcac ttnctccaca caaacctcan nctgaggtgt 180 ngatntcaat naaaggattc tnttattacc tnnacgntcc tctgnctccg gtnnanccca 240 tcc 243 34 710 DNA Homo sapiens misc_feature (1)...(710) n=A,T,C or G 34 tttgactccg ttttgcatca agcttggtac cntnctcgga tccctagtaa cggccgccag 60 tgtgctggaa ttcgcggccg cgtcgacgtt tncttgaagt aatttcagga actttagggc 120 gctgaaaaac attcctctaa gtatttattt cttgagtttg ttattgtgca aattcatata 180 tttcatcata atattttggt tgaaattaga aagtagtagt aataggcaat ccaaaaaaat 240 ggatacagaa cataatgaag tcttaaactt tattcagagt aagaacactt tgtagaaata 300 agtacacttc atatttgtac atccctgcta gacagtttga agcccctgga aattatttta 360 catttatttg agttatttat aacatgtgaa ttggcttcga ggtgttgaaa atattctaga 420 cttgcncacc aagaattacc aataagaccc tgacctactt agggcttttt gtagtctttt 480 ctttagtgta gaagtcaaag aaacccgata cngatattgn ccttgatagg ctcatatgct 540 annattnggt taggatatga naaaagccct tgattttnta aaaaagccnt ggnatgagnc 600 ttcncttcat naantcctga ctntgatncn ttttcatccc cntaggaaaa agcctccntc 660 gacttcctga annaaaaagg ggggactnct gantgatgca gtaaanccca 710 35 693 DNA Homo sapiens misc_feature (1)...(693) n=A,T,C or G 35 ttnactncnt tttgcatcaa gcttggtacc ntnctcggat ccctngtaac ggccgccagt 60 \ gtgctggaat tcgcggccgc gtcgacggnt tnactaagca ggaaggctgt gtctcccttc 120 cccacactgt attccaacca gacagctccc ttctatttta tatgttgggc tagtatgaaa 180 aacaattctg ctaggaaaaa aaaatttttt tagaagtttg aaagctagtg atttgaccca 240 agttcctcat tttacagctg gggaaactaa ggcctagaga gaggcaagtg atatttgttt 300 catggcangg ctgggctgag atcctgcttt tctgggtctg tctttttgct cttttgggtc 360 tgatgcagtt ctcaccccac acagnccaaa tctgttgctt ctggcctcag ctgtgcccca 420 gcttactcag tgacctcttg caagtcctgc cctctctggt ctcagtttct tcatctgaac 480 caggggtacc ctggaacaag ctgctctctg caggatctgc caccttgaaa ccttgaccta 540 agaaaaaact acctgggggg ggggatctgg ggtttgctac aaaaaactta cngacccttg 600 gcaagggggg tanaaagnat ttnaacccca atttttacca acaccaggca ggggngggtc 660 aaaaacncaa cctttttccc aaaccaagaa att 693 36 341 DNA Homo sapiens misc_feature (1)...(341) n=A,T,C or G 36 aacgatttgn ggcagcccta tacantctng cattgaaggn ntggngtgta gactaganag 60 nccccntnag tctgtaccgn gnatgcacac anatganatg tatctgccca nngcgggctn 120 accgatgggt agatccccct gactcagagc acagtantgc tgtgccaata aaggantacc 180 nngatactgt catgngcgng ggcatattgn ggatnaaagc nngtncctga nnaccaccna 240 nttgggggcn gagtnggcnt angatanacn ggangaactg gcnnatgtgc ancncnccga 300 aaatgatntc cccatngacg ttaangggan gtcntgggng n 341 37 232 DNA Homo sapiens misc_feature (1)...(232) n=A,T,C or G 37 gcttcctttt ggncngnatg gggccctnta gangcatgct cgagcggccg ccagtgcgat 60 ggatatnttt tttaatcttc ttctcncccc nggggntttt tacattctct cccccnccnt 120 cctccctntc gtctctaccc cctcatcctc ctgnncntcc ccctactnng cctccntnct 180 ntctactntc cnctcnctca tttccctnnn ctctttccna cactcattat ct 232 38 227 DNA Homo sapiens misc_feature (1)...(227) n=A,T,C or G 38 tcagtggttt aaacgggccc tctatactcg agcgnccgcc cttttttttt ttttttatnt 60 ggtggaantc tgcanatatc cagcncagtg gcggccgcgt cgacgtgtga gcatggnatt 120 ttgtctcgga agaaaaancc ntgggtnngg cccccacaat accacccaca ttgcactcgg 180 ngatgaacaa gggctgactg attgatnggn tattncagag tntctat 227 39 413 DNA Homo sapien misc_feature (1)...(413) n = A,T,C or G 39 ggcacgaggc tgcgttgcgt ttgtttacag accacgcaag gagttcatcc caaaatgatc 60 agtaatctgc aagtgttcgc cataggccca cagtgctcca aggtggaagt ggtagcctcc 120 ctgaagaacg ggaaggaaat ttgtcttgat ccagaagccc cttttctaaa gaaagtcatc 180 cagaaaattt tggacggtgg aaacaaggaa aactgattaa gagaaatgag cacgcatgga 240 aaagtttccc agtcttcagc agagaagttt tctggaggtc tctgacccan ggaagacaag 300 aaggaaagat tttgttggtg ttggttattt gttttccagt agttagcttc ttcctgattc 360 ctcactttga agagtgtgag gaaaacctat gttgccgtta agctttcagc tca 413 40 413 DNA Homo sapien misc_feature (1)...(413) n = A,T,C or G 40 ggcacgaggc tgctgttctg gggtgtggtg tntcacagct tcccancgac tctngaaaca 60 caagancaag atgtggactt acnccagana tacctggaaa aatactacaa cctgaagaat 120 gatgggaggc aanttganaa gcgganaaat antgncccat tggttgaaaa attganncan 180 atgcaggaat tctttgggct gaaagtgact gggaaaccag atgctgaaac cctgaaggtg 240 atgaacagcc canatgtgga ntgnctgatg tggctcactn tgtcctcact gangggaacc 300 ctcgctggga gcaaacacat ctgacctaca tgattgaaaa ttacacgcca aatttgccaa 360 gancagatgt ggaccatgcc attganaaag ccttccaact ctggagnaat gtc 413 41 411 DNA Homo sapien misc_feature (1)...(411) n = A,T,C or G 41 ggcacgaggg aactatgcct gggcngggcg aagccagagg aaactctggt ggaggtccgt 60 ancggtcctg acgtgcaaat cggtcgtccg acctgggtat aggggcgaaa gactaatcga 120 accatctagt agctggttcc ctccgaagtt tccctcagga tagctggcgc tctcgcagac 180 ccgacgcacc cccgccacgc agttttatcc ggtaaagcga atgattagag gtcttggggc 240 cgaaacgatc tcacctattt tcaactttta aatggggtna aaaagcccgg ttcnttggcg 300 tggaanccgg gcgttgnaat gcaagtgcct antggggcca ctttttggna agnanaactt 360 ggcgcttgcg gggatgaacc cgaaccnccg ggtttaaggg cgcccnaatg c 411 42 411 DNA Homo sapien misc_feature (1)...(411) n = A,T,C or G 42 gcacgaggca aatctcctgc ctggtacaga atatgtagtg agtgtctcca gtgtctacga 60 acaacatgag agcacacctc ttagaggaag acagaaaaca ggtcttgatt ccccaactgg 120 cattgacttt tctgatatta ctgccaactc ttttactgtg cactggattg ctcctcgagc 180 caccatcact ggctacagga tccgccatca tcccgagcac ttcagtggga gacctcgaga 240 aagaatcngg gtgccccctc tcggaattcc ttcaccttca ccaacctnat tccaggcaca 300 gagtattgtg ggncagcatc gttgttctta atgggcaaag aggaaagtcc cttattgatt 360 gggccacaat caacagtttt gatgttccna agggacctgg aagttgttgc t 411 43 411 DNA Homo sapien 43 ggcacgaggc aggtcagaat ggcggcagcg gagcatcgtc attcttcagg attgccctac 60 tggccctacc tcacagctga aactttaaaa aacaggatgg gccgccagcc acctcctcca 120 actcaacaac attctataac tgataactcc ctgagcctca agacacctcc cgaatgtctt 180 cttattcccc ttccaccctc tcctcttcca ccctccgtgg atgataatct caagactcct 240 cccttagcta ctcaggaggc cgaggcagaa aaatcactga aacccaagag gcagaggttg 300 agtaagctga gaacaggtca ttgcactcaa gcctgggcaa taagaagaaa tctgtgagtg 360 gaacaaaaga aaaaaatcaa aaaacaaaac aaaacccaca cttccaaaac c 411 44 405 DNA Homo sapien misc_feature (1)...(405) n = A,T,C or G 44 ggtaagaatg gccttggaag ggatgagcaa acggaagaga aagagaagtg tccaggaggg 60 agagaatcct gacgacggcg ttcgcgggag tccgccggaa gactacangc ttggacaggt 120 cgccagtanc ttatttcgcg gcgaacacca ttccagaggt ggcaccggtc ggctggcgtc 180 cctcttcagt tctctggagc cccagattca acccgtgtac gtgcctgtgc ctaaacaaac 240 catcaaaaaa acgaaacgga atgaggagga aganagtaca tcccagattg aaagaccact 300 ttcncaagaa cctgccacaa aagtgaaagc gaanaanaan cncactaacg canaaaaaaa 360 \ gttggcanac agggaaagcg cttttancga gtgcttgntt tanaa 405 45 376 DNA Homo sapien misc_feature (1)...(376) n = A,T,C or G 45 gctgcaccac tggcatggat ggcggcatga gtatctggga tgtgaagagc ttggagtcag 60 ccttgaagga cctcaagatc aaatgacctg tgaggaatat gttgccttca tcctagctgc 120 tggggaagcg gggagagggg tcanggangc taatggttgc tttgctgaat gtttctgggg 180 taccaatacn angttcccat aggggctgct ccctcaaaaa gggaggggac agatggggag 240 cttttcttac ctattcaagg aatncgtgcc tttttnttaa ntgcttncat ttattgaaaa 300 aaaaaaaaan tgcccccaaa gcactatgct ggtcatgaac tgctncaaaa tgtggaggta 360 ataaaatgca ctgtgt 376 46 330 DNA Homo sapien misc_feature (1)...(330) n = A,T,C or G 46 ggtcangcct cangaccaca ngntcttatg atgggtcatt ttccctctca tccctgagag 60 aannacanta acctgtcatt cnctggactg aatctggttg cccggaatga gtgctcactt 120 gatggagacc cagcagaggg aggcanaagg gccttcccac tctgccagct tcctggagcc 180 gtgcatttcc tcccccttgg agacacaatg gctncatctg gaacaagagg tggngtggct 240 cctggctaag cttgntgtgc anagcccttg cccccagtca ctgctacact gccacngtaa 300 tgtgnaggtg gcacagcnct cacagtgatg 330 47 218 DNA Homo sapien 47 ggtcacggag gataagatca atgccctcat taaagcagcc ggtgtaaatg ttgagccttt 60 ttggcctggc ttgtttgcaa aggccctggc caacgtcaac attgggagcc tcatctgcaa 120 tgtaggggcc ggtggacctg ctccagcagc tggtgctgca ccagcaggag gtcctgcccc 180 ctccactgct gctgctccag ctgaggagaa gaaagtgg 218 48 406 DNA Homo sapien 48 gctgaaaagg gcatctcaga tttagctcag cactacctta tgcgggccaa tatcacagcc 60 atccgcagag tccggaagac agacaataat cgcattgcta gagcctgtgg ggcccggata 120 gtcagccgac cagaggaact gagagaagat gatgttggaa caggagcagg cctgttggaa 180 atcaagaaaa ttggagatga atactttact ttcatcactg actgcaaaga ccccaaggcc 240 tgcaccattc tcctccgggg ggctagcaaa gagattctct cggaagtaga acgcacctcc 300 aggatgccat gcaagtgtgt cgcaatgttc tcctggaccc tcactggtgc cagggggtgg 360 ggccttcgag atggctgtgg cccatgcctt gacagaaaaa tcaagg 406 49 680 DNA Homo sapiens misc_feature (1)...(680) n=A,T,C or G 49 aaccccctct ttacctgata ttttanttcg agactctagc tacatgccca cctacttaac 60 aggtactagt gacaggtaca aaaacattat gggtaacaat tctgagtgtt taatgcaagc 120 ccaggtgaag cagggtagct tccatcagca ggtacagacg ttacgctgaa aagaggtgca 180 ttctgcattg cactcctgga tctaagtttc tgtattctca gagcatcaat gcagcaagct 240 tattgttcct caatttttta caatatttat cacaactctg ggagaaaaca aaacaaaacc 300 tatcctattt actatttgtg ctacctagtg aggagatacc gctctgttta gacaaattaa 360 ggcacttcac attcttccac ccaattggaa agttttgtat ccttacagtt tccttttttt 420 taaaataaat ataattttat ttggagncac cttttctatt ctaactaggt cacctgtgga 480 tacaaggtat taaaggtaaa aggtgggggt tgtcctccat tttaaataat ttcaggaaat 540 aaaccaccat ggaaggtaat gnaacttggc catttatctt ttcccccttt tgtaccaaaa 600 tggagggaaa agtggagggc tcacagaaag tttaaattgg cccccagggg tccccaccaa 660 cctaagttca ggtgccagaa 680 50 106 DNA Homo sapiens misc_feature (1)...(106) n=A,T,C or G 50 attctgaaaa tgaggatgtc aatcngcatg ggagcgactc tgagagtgaa gagaccagga 60 aattacctgg tagtgactct gaaaatgagg aacttcttaa tggnct 106 51 668 DNA Homo sapiens misc_feature (1)...(668) n=A,T,C or G 51 tgatgaatgt tggtgcccgg ctctgggtgt tgagcgtaga acagacagca gattcactcc 60 agtgtcttca cacactcatt aggtgttttc tgttccagtg aagtctccag acaagggtat 120 atngaaagac atttttatag gacccagtgt aaagttcagg cccaaatgcc taacatgtct 180 aggtgttttt gtcagtaatg agactggata aatggtatgt aagtgcttgc accccacttt 240 tggaagatta atttaccaag aagcatgggg ttcagttggt gttgctgaca aatatctcaa 300 aaaagccaca aaggtgtctt tgatcgtgac aggtgaggag gtggagaagc gtgctttgag 360 tggtgagcag gtcctgactc gtggctcctg gaggggctgt gtctgctagt gtgcagtctg 420 gggctgttaa aggggcacag tgggacaggc ttgttttcag tggcttanat cttaaaccac 480 aacgtaaaat ctatttctgt taagcagctt ctctgaaaaa ttatnacagc tcanagagcc 540 cttctatgac aagctgtgcc attttaagct cttactttga gggactgaag natancccct 600 gctactgctg gggttcttta tttgggaatc atcccgncta acannaangc ttgaggggng 660 gggggtcc 668 52 708 DNA Homo sapiens misc_feature (1)...(708) n=A,T,C or G 52 ctaagctaaa tatagaataa gctttctaaa ttaaaatggt tttataaaag gagcttgtta 60 gtggggtcat ttttgtactg tgagctttat gtgtaaatgt ctacacaccc acttaacatg 120 tgttgatttc actttagact atgaggaaac cacaggggag tttcaggcca gtcagctttt 180 gatcttcaac tttataactt tcaccttagg atatgacgag cccaccggag tttcaaaaat 240 ggtatcattt tgtatcanac ttgtttttta cactcttggt ttctcacaga gataggtggt 300 ttctccttaa aatcgaacat ttatatgatg cattttactg tagttactat cagaaaagtt 360 agttttccca aatttaagtt cactctgggg tactatagcg tgaatgtagt tcattctgtt 420 gagctagttg ttcatgttag tgtagttcac atatttatct ggaactcaaa aatgaggggt 480 tgagangggg aagctaaaat tcaaaacatg tccaaatata taatttaaat attttacttt 540 atatttaaaa tagaaaagcc aattgattct agaattagac taattgctag cattgcttng 600 catattttaa aaanggaaag cctgaaatgg tttttnaaac tcnttgggaa attttttttn 660 ttggnaaata ggggcccnta aanaaaaaaa aattaaaagg ggggcntt 708 53 245 DNA Homo sapiens misc_feature (1)...(245) n=A,T,C or G 53 tcgaaattaa ccctnctaaa ggnacaaaag ctggancntc gcgccccnng caggtcnaca 60 ctagggnanc caaaaatttc ggnncnaggg ggattgggtc ncnggtnttg ngttggngaa 120 ccctgtngaa cccgacntga caatgcncna nattttcgaa aacantttct gggnaaaaac 180 natcncccta ngggtagacc cnagacacnc catagaatgt caaggnaaaa atccnanata 240 aggaa 245 54 404 DNA Homo sapiens misc_feature (1)...(404) n=A,T,C or G 54 gtcgtaggga acggggtgtt tccctanagg aggcacacac ctttgccagg tgccctacta 60 gacatctctt gctatgcctc ccatccaggt gtagtctcca cagcttccct ttggtttcct 120 cgaggtctct cccttgcaga ttaccacctc ttcttctaag cacctttcct ggtcagatct 180 gtccgcggtt ggggagctag gctgagtcat actcccaggt gccccggctt ctccatgtct 240 tgagaaccag tctccctggt gggcagccag agcagctaca gtgggctgga gactggaggg 300 tggctttagg ccccctgagg ttctagctct ccagggtcag gcttgcagaa cggggcagtt 360 taaacctgat cccttggctc attcatgcct catgacgtgg aaag 404 55 406 DNA Homo sapiens misc_feature (1)...(406) n=A,T,C or G 55 gtttaaactn atcattcccc gggctcgagt gcggccgcaa gctttttttt tttttattaa 60 gaaagcatgc aaaaaaacaa aaacaaaaac aaaacaaaac aaaaaaacca aacaagttgt 120 gagcccatac acccccttcc cagcctgcac caatgcaaag cattatgggt ggtataaaga 180 actcaccaaa ccagaaacat ggggttgagg agaggaagcc taggataggt ggagcaacac 240 aaatgacttc tcctttgtcc tgcacctctg ctgtctcccc tatttcctct tataggcagt 300 gacccccaca actgtgcaaa tatcacccaa gacaacagca aggactgtac ttcatgtcca 360 gcaataccgg accaacaata agagtctttc cgtatggagg gatggg 406 56 555 DNA Homo sapiens misc_feature (1)...(555) n=A,T,C or G 56 ntcctggtgg gcagccagag cagctacagt gggctggaga ctggagggtg gctttaggcc 60 ccctgaggtt ctagctctcc agggtcaggc ttgcagaacg gggcagttta aacctgatcc 120 cctctgggcc tccgattccc atgccctcca tgacgtggaa agggaactac ttttattttg 180 ttggtccggt attgctggac atgaagtaca gtccttgctg ttgtcttggg tgatatttgc 240 acagttgtgg gggtcactgc ctataagagg aaatagggga gacagcagag gtgcaggaca 300 aaggagaagt catttgtgtt gctccaccta tcctaggctt cctctcctca accccatgtt 360 tctggtttgg tgagttcttt ataccaccca taatgctttg cattggtgca ggctgggaag 420 ggggtgtatg ggctcacaac ttgtttggtt tttttgtttt gttttgtttt tgtttttgnt 480 tttttgcatg ctttcttaat aaaaaaaaaa aaagcttgcg gccgcactcg agcccgggtg 540 aatgattgag tttaa 555 57 406 DNA Homo sapiens 57 tgtccctggt gggcagccag agcagctaca gtgggctgga gactggaggg tggctttagg 60 ccccctgagg ttctagctct ccagggtcag gcttgcagaa cggggcagtt taaacctgat 120 cccctctggg cctccgattc ccatgccctc catgacgtgg aaagggaact acttttattt 180 tgttggtccg gtattgctgg acatgaagta cagtccttgc tgttgtcttg ggtgatattt 240 gcacagttgt gggggtcact gcctataaga ggaaataggg gagacagcag aggtgcagga 300 caaaggagaa gtcatttgtg ttgctccacc tatcctaggc ttcctctcct caaccccatg 360 tttggttggg agtcttatac cacccataat gctttgcatt ggtgca 406 58 859 DNA Homo sapiens misc_feature (1)...(859) n=A,T,C or G 58 59 354 DNA Homo sapiens misc_feature (1)...(354) n=A,T,C or G 59 tgatcccgnt nncaccagnt gacaggaagt gtggctggtg cggaggtgac agactgttgc 60 agactcaata acattacgag atggaggtcc ctccaggttt tttacagcct ctcttactag 120 ctnctggcat ttatttagct cttccatttg tgttgtaagt aagaactgaa gacctctgca 180 atcacggaac ttctctgaca tagaaagctt gccagtttgt tgcttgtagt tgctggttat 240 ttcatttcgc actcgctgaa ctagctcctc atcaatagta aattctattg ctctgtggat 300 cacatttagc caccaaggag aattagaatg aatctttctt tgaagctcat ggat 354 60 370 DNA Homo sapiens misc_feature (1)...(370) n=A,T,C or G 60 gtttcanatt acacagcnng tgagaggaag tctggctggt cggaggtgac agactgttgc 60 agactcaata acattacgag atggaggtcc ctccaggttt tttacagcct ctcttactag 120 ctggtggcat ttatttagct cttccatttg tgttgtaagt aagaactgaa gacctctgca 180 atcacggaac ttctctgaca tagaaagctt gccagtttgt tgcttgtagt tgctggttat 240 ttcatttcgc actcgctgaa ctagctcctc atcaatagta aattctattg ctctgtggat 300 cacatttagc caccaaggag aattagaatg aatctttctt tgaanctcan ggatggtctn 360 ctncaagang 370 61 770 DNA Homo sapiens misc_feature (1)...(770) n=A,T,C or G 61 gtttcagaga ccagcanttg agaggaagnc tggctggncg gaggtgacag actgttgcag 60 actcaataac attacgagat ggaggtccct ccaggttttt tacagcctct cttactagct 120 tctggcattt atttagctct tccatttgtg ttgtaagtaa gaactgaaga cctctgcaat 180 cacggaactt ctctgacata gaaagcttgc cagtttgttg cttgtagttg ctggttattt 240 catttcgcac tcgctgaact agctcctcat caatagtaaa ttctattngc tctggggatc 300 acattnagcc accaaggaaa atnaaatgaa tctttctttg aagctcatgg atggnctgct 360 gcacaggata taaagcttgc tgggcttcag caacttctgt attacacttg ctcatgtagt 420 gctctcgcag ctgtttggcc tcttcctcaa gtcggccatc acgcaaggta ggtggtatcc 480 ctgggtgcct ggctatcaac aattccatca agttatgggt agcatgaagt ctttgaagtg 540 aatcagtttt gagttttcct ttgtgttcct ccgaggagcg caacacttct ctgtacaatt 600 ctgctgccaa ggcatactca cctttaataa tntgaaatgc ctgctaagcc attgagagca 660 caaactagct ggcgatgtgc tttnttcaca atcaagtcca caattttntt tctggaaaaa 720 aatngtcaan cagncctctt ttcccatttg ngcnattggg gggcnttttt 770 62 130 DNA Homo sapiens misc_feature (1)...(130) n=A,T,C or G 62 ttccccctta tggcctctac agtatgctcg acggccgcca gngtgatgga tatctgcaga 60 attcggctta gcgcggtttt ttgtcgacgn ccattntctc cnnggantgt cccacttctc 120 tccaatcttg 130 63 128 DNA Homo sapiens misc_feature (1)...(128) n=A,T,C or G 63 tnggntngga cngaggagnc aaaaaaaggg gcaggggncn tttanaaccc cganaaggac 60 cgnagncgcn gaccccccca gngtgttttt ttttccccaa cggcnaacac ccgcaangga 120 cnacacgc 128 64 164 DNA Homo sapiens misc_feature (1)...(164) n=A,T,C or G 64 ccaagtaacc ggccgccagt gtgctggaat tcggcttagc gnggncgcgg ccgntntgtn 60 cttcagggcc tgcntatgcc cntgtgggng aacaccagaa annagcaang ggcaccctng 120 naacccaccg accttgcnna ggcngtgana aggacgggga caac 164 65 290 DNA Homo sapiens misc_feature (1)...(290) n=A,T,C or G 65 aaaaancccg cccncacctg ccgggnnttt nagatgggcc ncnctgacgg gaacantgga 60 aaccacnnga ttgaccccaa nccnggctgc aagcnggaac cantaaaaga ttctggacan 120 ngagacnggg gagacctgnc gtccccctcn gncngggggg gcngaaaact ggacancann 180 gagaacccca ggacaagang atnttggtng ggagacatgc cgagatacat tcagatggcg 240 cnggtcnanc tgcatgngac tcgcggaccg ctatnntatn ccccactggg 290 66 125 DNA Homo sapiens misc_feature (1)...(125) n=A,T,C or G 66 gggncttgac gttnactcat cattccccgg gctcgagtgc ggccgcaagc tttttttttt 60 tttttatgat acaataactt taacaaataa aaacaagact aganagcaac ctggnaaanc 120 ttggg 125 67 122 DNA Homo sapiens misc_feature (1)...(122) n=A,T,C or G 67 tttatgatnc tntngcttgg ncccgagctc ggatccacta gtaacggccg ccagtgtgct 60 ggaattcacc atcaccaacc tgcagtatgt tgtggtgtct gtactctggc tgcagactga 120 aa 122 68 115 DNA Homo sapiens misc_feature (1)...(115) n=A,T,C or G 68 cgatgcatgn ntgaacggcc gctttcagtc tgcanccana gtacagactg accgctgagc 60 aataactagc ggcntggtcg gaacaggaga gcgcacgagg gagcttccag gggga 115 69 102 DNA Homo sapiens misc_feature (1)...(102) n=A,T,C or G 69 cgccnagctt ggtaccgagc tcggatccct agtaacggcc gccagtgtgc tggaatttca 60 gtctgcagcc agagtacaga ctgaccgctg agcaataact ag 102 70 173 DNA Homo sapiens misc_feature (1)...(173) n=A,T,C or G 70 ctggaacctc aaccnntcat cttagagaac tgcagggtct gctcaaaccc ttgttcagga 60 ttagcagnct ggaatacctc tattcaggct gcngactagc ctcactcagg tgagacgctc 120 cttaagaaaa agcttgcggc cgcactcgag cccgggtgaa tgattgagtt taa 173 71 150 DNA Homo sapiens misc_feature (1)...(150) n=A,T,C or G 71 ccgtcctnac nncacaggcc ctgngatgga catacngtac tatgaggaat gagctnagga 60 cgactgaccc acggtcatcc actgtgcatg aacttccnac angcacggac agcggacntc 120 cttgtatgat gggggtttnn cccatcgaaa 150 72 171 DNA Homo sapiens misc_feature (1)...(171) n=A,T,C or G 72 taccttntct acnnataggg cncccactnt ctggacaaag acagcctctn tnttaacagc 60 tcccaanccn tccaccacat nggtttcntt ctaaatcaga agccacaaca nccatggggt 120 accacctgaa gaccctcaca ctcaagcaaa gnatctccaa tctccaactt c 171 73 179 DNA Homo sapiens misc_feature (1)...(179) n=A,T,C or G 73 ngnaganctt tngggncagg ntacagtcct ctaccntagc gcngncaagc cacggcanga 60 catcgnngac ttctttntca ccaaaccnna acagcnggtg gggtaccacc tgntcaccac 120 tcncactcaa cttcaccatc tctngatctc catttttcac cagatatggg caagggctc 179 74 164 DNA Homo sapiens misc_feature (1)...(164) n=A,T,C or G 74 ctttnntttn aaaataaacc cccaccgccc taattnnttg gggggggnnt taaacccccg 60 ccaacgggga aaccgcaaac cggtanctaa agggggcccc aatgggggcc gnaattattc 120 cgggggattc cgnaaatata caaggccgna tgtatgnatg gggg 164 75 151 DNA Homo sapiens misc_feature (1)...(151) n=A,T,C or G 75 cnccttncac ncggggacng nctactgagg atttgggctg acctttnngc catgatcana 60 gcttgacatg cttcccctga atanggttct tctgccctnt ngngactacc acaggtgnac 120 ctgcgaaact cnggagatnc ncccnttcta c 151 76 131 DNA Homo sapiens misc_feature (1)...(131) n=A,T,C or G 76 tttttaaaaa nacggggncg gccncccttt ctttaanata aacnttacgg gaagngccgg 60 ggcgccggcc ccacntgggt gggcccttgg gggaattaat tccttggcca agngaaaatt 120 ttccggccgg g 131 77 471 DNA Homo sapiens misc_feature (1)...(471) n=A,T,C or G 77 gcggccgcgt cgacaggcna tccgcctgcc ttggcctccc agggtgttgg gattgcaggc 60 gtgagccact gcgcctggac ctcctccaat atttggccaa gccacgcaat ctttaagctc 120 ccatgtctgt ccatgattgt aggtttataa tttggaactt cggttattta attaattaaa 180 aatattcaga tatatcctgc aaataaatac aaagtttttg tactcagcat ctttagtgca 240 ctttcccaca aggagttgta tgtaaatgaa aaaaaataga taaatgcaag catgtaaatc 300 aactcctatt attaaccatt tacttagctc aagtttattt acacctgaat gaaaagttcc 360 atttgcatca aaaatacatt cggacactgc cttaagagta tacgaagtcg acgcggccgc 420 gaattcgcgg gcnngtcggc acngagaagn angcncggag ncaagcgggc n 471 78 114 DNA Homo sapiens misc_feature (1)...(114) n=A,T,C or G 78 tcgcggcccg cgtcgacaat gaatcaataa ttntgtccga ttagcatatt tattatatgt 60 taagtcttcc tcaatgcctt gntggaggga naagggnggg ncatnaatnt tggg 114 79 795 DNA Homo sapiens misc_feature (1)...(795) n=A,T,C or G 79 tcgcggcccg cgtcgacctg gggagatgtc atagnacaat atttttaggt tgagaataat 60 tgtacatact catgggttac gtagtgaagt tttccataca tgaaacataa agtgatcaca 120 tcacaaccag ggtcactagc acggccagta tcttgggcac gcgccactca gccttgcagg 180 gaaggtggtg cacaggggtg ccgggctgcc tgggtctgtt ggagatgacc ctgggggaca 240 ggcgtcttgt ggggaagata gtgaacggga gtgtggggct gttggggtct gtgtggaggt 300 ggccccaggt ggacgggctt catgcctccc tgtggtcctg atctccatgt ggctcctgat 360 ccctgagttg ttgcgggtct ctgctgggcc ttgccgtagg tagtgtaacc ttcatggcgg 420 cttgctagga ctgtatataa aagaagaata tgatgtatgt ttaataatta ttggcattga 480 tgtatgtcaa atggtactct atgtctattg ggttcagtac agtattttat ttgttttccc 540 tatcattttt acctttgaac atctgggtca gggttagaca gatttaatac agcttattca 600 gctactctta gtgtcatcct gccaaagaca ctccagtcag agagccanag caggtcctgg 660 cccancctcc cttggatgtc ccagggccct gccaccatct gtaaggggta taggacttct 720 ggcggtgacc cgggatgtgg nctgggtcat gccccncaca tctntatgtg caatgagtgt 780 ntcctcagga ccggg 795 80 169 DNA Homo sapiens misc_feature (1)...(169) n=A,T,C or G 80 ggataagnng anaatggggg atcngagtat naacnngagg gggntttcnc ctgngaanna 60 gggttttatg ttgttaatgg ggggggggan cgacccccnt tgggntgggg taaanatgta 120 aagggagtat agggcngnga cnaatatgtt gancccngta agnaggaaa 169 81 145 DNA Homo sapiens misc_feature (1)...(145) n=A,T,C or G 81 agtntgctcc ccgcggtggc ggccgtctag cccgggcgga tcccccgggc tgaaggaatt 60 cggancgagg gcaaaacaca tagcctnccc cttccttgga ctnttcctnt tgaggggtta 120 atnttaccaa ggttcatttn gctnc 145 82 124 DNA Homo sapiens misc_feature (1)...(124) n=A,T,C or G 82 ggtggcgncc gctctagccc gggcggatcc cccgggctgc aggaattcgg cacgagggag 60 agantnagag agagagagag agagagagag agggatngan agnaaaaana aaaaaaaaac 120 ctat 124 83 745 DNA Homo sapiens misc_feature (1)...(745) n=A,T,C or G 83 ccatctgtag gctccttgct cttctgtaaa cctggcacaa ctaacaaatg acatcattca 60 aatgatggat ggggaagggc aggtataaaa ctagtccaag ggggcaaaaa gctactgctg 120 tggcgttaac tcagcaccac gcgtccagca atcactcggc ccattgatgg gtgagggcct 180 tcagcagttg ggagatgtgg ggcgcaggga ggacctccct gaagtcacct atcatttgtg 240 aagccggctg tcatacttgg ctggctttag gccttttcag agaacaggct gtggtgctga 300 gggaatcctg cccagccctt gatctcctct agatcttcct caatcacggc acaactgaca 360 ctgtggaccc aggattctct gacgtgggat cttagctgtg tgtcataggc ctctgcccac 420 tggattccac tggcatcccc aagctgtaac catcaaaatg tctccagttt cctgggggtg 480 gaggggaagg gtgcaaaatc gctcacaacc actgctctgg acagagccct gatccagaag 540 tatagaagct gcttcactga aggggaaagg aactcatact tttaaatcgc cacaaggtct 600 cggatcctgt gccaggtgct ttcacacgcc atctcattcc aaaaatcctt gtgacaataa 660 tccattatct cattttttaa aaggcaaatg ggaatttgag ggctcaaaac agaatcctgc 720 agtcaaataa gctgntcacg cctcn 745 84 796 DNA Homo sapiens misc_feature (1)...(796) n=A,T,C or G 84 ttcaaatgat ggatggggaa gggcaggtat aaaactagtc caagggggca aaaagctact 60 gctgtggcgt taactcagca ccacgcgtcc agcaatcact cggcccattg atgggtgagg 120 gccttcagca gttgggagat gtggggcgca gggaggacct ccctgaagtc acctatcatt 180 tgtgaagccg gctgtcatac ttggctggct ttaggccttt tcagagaaca ggctgtggtg 240 ctgagggaat cctgcccagc ccttgatctc ctctagatct tcctcaatca cggcacaact 300 gacactgtgg acccaggatt ctctgacgtg ggatcttagc tgtgtgtcat aggcctctgc 360 ccactggatt ccagtggcat ccccaagctg taaccatcaa aatgtctcca gtttcctggg 420 ggtggagggg aagggtgcaa aatcgctcac aaccactgct ctggacagag ccctgatcca 480 gaagtataga agctgcttca ctgaagggga aaggaactca tacttttaaa tcgccacaag 540 gtctcggatc ctgtgccagg tgctttcaca cgccatctca ttccaaaaat ccttgtgaca 600 ataatcatta tctcatttta aaggcaatgg aattgaggct canagaacag aaattacctg 660 ngcaagggtc acaaaaactt aaagcttgtt nttacaaccg gggncccctt ncaacaaccc 720 ccnaaaaggc ctttcctccc acaatcncca tgctgccgnt ctatctgcaa acccgaactg 780 catgtccang gacact 796 85 706 DNA Homo sapiens misc_feature (1)...(706) n=A,T,C or G 85 tnnaataatg cggttggtga tggtaaagtt gagggataga tggattggag aaggagagga 60 gaatggaact gatatttatt gagtctctat catgtaccaa gaatcggact tatattagac 120 cattgaggac aggggagcag ggagagggat agatggattg gggttgggga gggcagagag 180 gggccctgtc ctggggctgt atagggaagt gtgtgtggca ggagatgaca cagatggagt 240 agaaggaacc cggctgtgca tctgcgcaca ccctgcacta ccatgctctg gctctgtgac 300 ctcggatagg tcgcataccc tttcacaact cagcatcttc tgttacgtgg cggacagcca 360 tgctggcttc ctagattccg tgggagcctc atgggctaac atcagcatca ctgggcacac 420 agcaagtgtc ctggacatgc agctcgggtc tgcagatana gcggcagcat gggattgtgg 480 gaggagcttg ggtgtgaggg ccntgtanac agcttacttt tgtgaccttg cncangtaat 540 ttctgttctc tgagcctcaa ttccattgcc tttaaaatga gataatgatt attgtcacaa 600 ggattttttg gaatgagatg gcgtgtgaaa ncacctggca caggatcccg anaccttgtg 660 gcgatttaaa aagtattgag tttccttttt cccccccttt tncncg 706 86 628 DNA Homo sapiens misc_feature (1)...(628) n=A,T,C or G 86 gtaatgcagg ttggtgatgg taaagttgag ggatagatgg attggagaag gagaggagaa 60 tggaactgat atttattgag tctctatcat gtaccaagaa tcggacttat attagaccat 120 tgaggacagg ggagcaggga gagggataga tggattgggg ttggggaggg cagagagggg 180 ccctgtcctg gggctgtata gggaagtgtg tgtggcanga gatgacacag atggagtaga 240 aggaacccgg ctgtgcatct gcgcacaccc tgcactacca tgctctggct ctgtgacctc 300 ggataggtcg catacccttt cacaactcac catcttctgt tacgtggcgg acagccatgc 360 tggcttccta gattccgtgg gagcctcatg ggctaacatc ancatcactg ggcacacagc 420 aagtgtcctg gacatgcagc tcgggtctgc agatagagcg gcagcatggg attgtgggag 480 gagcttgggt gtgagggccg tgtanacagc ttanttttgt gaccttgcnc aggtaatttc 540 tgttctctna gcctcaattc cattgccttt aaaatgaaga taatgattat tgncacaagg 600 atttttggan tgnanatggc cgtntgaa 628 87 789 DNA Homo sapiens misc_feature (1)...(789) n=A,T,C or G 87 aggaactgat atttattgag tctctatcat gtaccaagaa tcggacttat attaggacca 60 ttgaggacag gggagcaggg agagggatag atggattggg gttggggagg gcagagaggg 120 gccctgtcct ggggctgtat agggaagtgt gtgtggcagg agatgacaca gatggagtag 180 aaggaacccg gctgtgcatc tgcgcacacc ctgcactacc atgctctggc tctgtgacct 240 cggataggtc gcataccctt tcacaactca gcatcttctg ttacgtggcg gacagccatg 300 ctggcttcct agattccgtg ggagcctcat gggctaacat cagcatcact gggcacacag 360 caagtgtcct ggacatgcag ctcgggtctg cagatagagc ggcagcatgg gattgtggga 420 ggagcttggg tgtgagggcc gtgtagacag cttagttttg tgaccttgcg caggtaattt 480 ctgttctctg agcctcaatt ccattgcctt taaaatgaga taatgattat tgtcacaagg 540 atttttggaa tgagatggcg tgtgaaagca cctggcacag gatccganac cttgtggcga 600 tttaaaagta tgagttcctt tccccttcan tgaagcagct tctatacttc tggatcaggg 660 ctctntccan agccagtcgg gttgttgnac cccaaatttt ttttgcnaac cccccttttc 720 cccccctttt tccccncccc ccccccccaa cgggnnaaaa ccntngggaa naaaccaatt 780 ttttngaan 789 88 392 DNA Homo sapiens 88 acaaagtaaa atagaaccac aaaataatga actgcatgtt cataacatac aaaaatcgcc 60 gcctactcag taggtaacta caacattcca actcctgaat atatttataa atttacattt 120 tcagttaaaa aaatagactt ttgagagttc agattttgtt ttagattttg ttttcttaca 180 ttctggagaa ccgaagctca gctcagccct cttccttatt ttgctcccaa agcctccccc 240 aaatcatcac tccctgcccc ccttaaggct agaggtgagc atgtccctca caattgcaca 300 tgtcaagcca tcagcaaggc gcatcacaca aaaggcacca agacgtgaaa ctttttaaac 360 caaaaggacg aagaaaaaac actttcaaaa aa 392 89 162 DNA Homo sapiens 89 accgagcgtg agcaaggaga agtccgcttc tctgccgtgg ctctctgcaa ggcagcctaa 60 tgctctgtgg gagggacttt gctgatttcc cctcttccct tcaacatgaa aatatatacc 120 cccccatgca gtctaaaatg cttcagtact tgtgaaacac ag 162 90 147 DNA Homo sapiens 90 accgagcgtg agcaaggaga agtccgcttc tctgccgtgg ctctctgcaa ggcagcctaa 60 tgctctgtgg gagggacttt gctgatttcc cctcttccct tcaacatgaa aatatatacc 120 cccccatgca gtctaaaatg cttcagt 147 91 571 DNA Homo sapiens misc_feature (1)...(571) n = A,T,C or G 91 acggtgatcg gtagcaacaa actggaacag atgccgtcca aggaggatgc cattgagcac 60 ttcatgaaat tatatgaaga aaaaaccggg aacgcttggc actccaaaaa tttcacgaag 120 tatcccaaaa agttctaccc cctggagatt gactatggcc aggatgaaga ggcagtgaag 180 aagctgacag taaatcctgg caccaagtcc aagctcccca agccagttca ggacctcatc 240 aagatgatct ttgatgtgga aagtatgaag aaagccatgg tggagtatga gatcgacctt 300 cagaagatgc ccttggggaa gctgagcaaa aggcagatcc aggccgcata ctccatcctc 360 agtgaggtcc agcaggcggt gtctcagggc agcagcgact ctcagatcct ggatctctca 420 aatcgctttt acaccctgat cccccacgac tttgggatga agaancctnc gctcctgaac 480 aatgcagaca gtgtgcaggc caaggtggaa atgctttgac aacctgctgg acatcgangt 540 gggctacagt ctgctcaggg gaaggtcttg a 571 92 422 DNA Homo sapiens 92 tgtttgatcc tctttggttg agcctgaaat gacatcatgg gcataaagcg ctttctaagc 60 tacaagaagc tacacacaca agcaattatc atagcatcat taagatacga tattggatgt 120 atgcaacttc aacttgagaa gggcaaaagc atgaagaaat caaacaggga gggaggaggt 180 gagggaaggg gagacagcaa cttggggcaa aactggggac agccaaattt gtccacgagt 240 catcagcttc tgatcagcct ggcggtgggg ggtcgctaga agagcttctc atagcatttc 300 ccacagtgaa tggtgtcacg gtgaatgaag atctgatcca ggagatcgcc catcggttta 360 ctgcaaatcc cacacttaaa gcaatattca tggcagcaga taccaagatg ttctagggta 420 at 422 93 237 DNA Homo sapiens 93 accaccacag atcctggcct gaacttaata ttggagaggc ccagaaaacc cccttgttta 60 taaatctgca aaaacatttc agaaatgtct ctctgcagct cttggtagta gtcggtgctg 120 ggatcttcca gagaggaatt aaactggagg tttgaaatgt gaaaagacag gaaaaagaaa 180 gagaccccag tagacaactg gggagaagtg ctgtgattgg aggaggtgag aggaggt 237 94 604 DNA Homo sapiens 94 accatcttca ctgggctccc tctttaataa gaaggagaac aaagaagtta ttcttaaact 60 tctggtcata tttgagaaca taaatgataa tttcaaatgg gaagaaaatg aacctactca 120 gaatcaattc ggtgaaggtt cacttttttt ctttttaaaa gaatttcaag tgtgtgctga 180 taaggttctg ggaatagaaa gtcaccatga ttttttggtg aaagtaaaag ttggaaaatt 240 catggccaaa cttgctgaac atatgttccc aaagagccag gaataacacc ttgattttgt 300 aatttagaag caacacacat tgtaaactat tcattttctc caccttgttt atatggtaaa 360 ggaatccttt cagctgccag ttttgaataa tgaatatcat attgtatcat caatgctgat 420 atttaactga gttggtcttt aggtttaaga tggataaatg aatatcacta cttgttctga 480 aaacatgttt gttgcttttt atctcgctgc ctagattgaa atattttgct atttcttctg 540 cataagtgac agtgaaccaa ttcatcatga gtaagctccc ttctgtcatt ttcattgatt 600 taat 604 95 171 DNA Homo sapiens 95 tgcctcacaa tgataggaag agccgacatc gaaggatcaa aaagcgacgt cgctatgaac 60 gcttggccgc cacaagccag ttatccctgt ggtaactttt ctgacacctc ctgcttaaaa 120 cccaaaaggt cagaaggatc gtgaggcccc gctttcacgg tctgtattcg t 171 96 469 DNA Homo sapiens misc_feature (1)...(469) n = A,T,C or G 96 ctatgggtgt ctgtaaatta tgtatgtcag ttggacattg tagaaggtat gtaaatcagc 60 atagttgtgt ataacttaac cttgatttat aaggtcttaa gattatgact attcattgac 120 atctcatgag aagctttaga agactttcta tttttaaaca ccatttatat gtggacttct 180 gttgtcactg actttgggct ttatattttc atagagtctt tatggaaaaa atagaattta 240 ttttccactc ttgtagctat agctgctgca cactttcacc ctgatttatt tttttgtttc 300 ttagctttga tgttttcaaa ccaaggattg tgattttagg ttagaattac atattagaag 360 cattaagact atgtctttgg atcagaatgc cttttctttt tttgcttcag ttgtaaagaa 420 gagggaatac atgataaagt aactggtttg attctcgtca ttgnacctg 469 97 466 DNA Homo sapiens 97 tgcaggggtt ttaacactta caataatagg aaatagccat taaaaagttg ctctaacttt 60 agatttctaa ctttagtgtt ctttaacaaa ggccatattt tgtggcctta aaaacaaaaa 120 attatatctg gctttatcta ttagtaaaca caaagggtcc atattttatt ctgaaaaaat 180 atttattata ttcattcata aatgttctaa ctaatttaac taaaaaaatc ttctagtatt 240 ttctgatgcc acaagcttac tagaaaatta cttctaaaaa ttggtaatat aaatcatcaa 300 tgatttacct actttaaaaa agaggggtat ctgtttctct tacatttaat aacctgaaaa 360 tgagtctata aaaatatttt taaaaaatac agtaacactg ctgagttttg ttaggtccct 420 tgttttttta attttttatt tatttattta tttttagcaa gaatgt 466 98 602 DNA Homo sapiens misc_feature (1)...(602) n = A,T,C or G 98 tgttaggatt ttttcttcag caccattgat gagatatggg tggttgcagc acttgcgcaa 60 ctccatcatt gtgttaagta gattaggcat gttggtatga cctgcccctt tggaaaggaa 120 ggagaaattc ttctccaaaa tagcccgata gtatttcttc tggatattag tcagctctac 180 ttcaataatt gtttcctgtt tgggtgccaa gtttttttca acatcctctt tgagtcttct 240 cagcatcatt ggcttaagaa tggcctgtag cttttgaacc tgttcctctg tcttgagatc 300 cccaaagtcc ttgagaaact ctgattctga gggaaattgt gacggttcca agaaatgaag 360 caagctaaac agttcttcta cagtattttg caatggtgtt cctgtgagta gcaccttgtg 420 ttccaggtcc atgtgcttga gactatcaag cagcttgcaa ttacggtttt tcagtcgatg 480 ggcttcatca atgataacac aacgccattc aatttcacga agctcaggac aatctgacaa 540 aatcatctca aaagtggtga tcagagcgtc aaacttgtat gcgcctggga tgangcgccc 600 cc 602 99 248 DNA Homo sapiens 99 actcaaaaac aaaaattgag gtatttggtt cttctaggag tagacaatga catttgtgaa 60 ggcagacacc tacacaaaaa taaataaggt atattctcat atgtatatgt gtcgtcggga 120 ataatactgg taggtatgtc aagcatgaag agattctaca aaaaacaggg gcgagcaaaa 180 agaaaccgga tcacacactg agtcctttct cttctgcccc attttgtcag atggacttga 240 tgtacctg 248 100 228 DNA Homo sapiens 100 ctggagactc tgggccagga gaagctgaag ctggaggcgg agcttggcaa catgcagggg 60 ctggtggagg acttcaagaa caagtatgag gatgagatca ataagcgtac agagatggag 120 aacgaatttg tcctcatcaa gaaggatgtg gatgaagctt acatgaacaa ggtagagctg 180 gagtctcgcc tggaagggct gaccgacgag atcaacttcc tcaggcag 228 101 138 DNA Homo sapiens 101 caacaaactt gtttatatat atatatatat atacacatac atacacacac agtcacattc 60 cagtgtgaca aatactcctt caccaggaga cagcgctacc actccgggat cttgaagact 120 ctctacagag agcctagg 138 102 511 DNA Homo sapiens 102 actgggtgct gacggctgct catgttgtgg agggaaacag ggagccaaca atgtatgttg 60 ggtccacctc agtgcagacc tcacggctgg caaaatccaa gatgctcact cctgagcatg 120 tgtttattca tccgggatgg aagctgctgg aagtcccaga aggacgaacc aattttgata 180 atgacattgc actggtgcgg ctgaaagacc cagtgaaaat gggacccacc gtctctccca 240 tctgcctacc aggcacctct tccgactaca acctcatgga tggggacctg ggactgatct 300 caggctgggg ccgaacagag aagagagatc gtgctgttcg cctcaaggcg gcaaggttac 360 ctgtagttcc tttaagaaaa tgcaaagaag tgaaagtgga gaaacccaca gcagatgcag 420 aggcctatgt tttcactcct aacatgatct gtgctggagg agagaagggc atggatagct 480 gtaaagggga cagtggtggg gcctttgctg t 511 103 211 DNA Homo sapiens 103 ccaataatca agacaaactg ggatttgagg atggatcagt tctgaaacag tttctttccg 60 aaacagagaa aatgtcccct gaagacagag caaaatgctt tgaaaagaat gaggccatac 120 aggcagccca tgatgccgtg gcacaggaag gccaatgtcg ggtagatgac aaggtgaatt 180 tccattttat tctgtttaac aacgtggatg g 211 104 228 DNA Homo sapiens 104 ctggagactc tgggccagga gaagctgaag ctggaggcgg agcttggcaa catgcagggg 60 ctggtggagg acttcaagaa caagtatgag gatgagatca ataagcgtac agagatggag 120 aacgaatttg tcctcatcaa gaaggatgtg gatgaagctt acatgaacaa ggtagagctg 180 gagtctcgcc tggaagggct gaccgacgag atcaacttcc tcaggcag 228 105 60 DNA Homo sapiens 105 taggtccaca acttcatgca gatgttagag acaaagtcaa aattatcttt aaaaacatgg 60 106 135 DNA Homo sapiens 106 actggatcat gggtgggggg ccacacatca gcaccagcgg ctcctcctct gggggtggaa 60 ggtggtcccg gatcatctcc tcattcacga agccctggcc gtagtcccag gcttcagggg 120 ctctgtccag cgtgt 135 107 580 DNA Homo sapiens 107 ctagacacca aatatagtgt gggaatacac aacctactag cctatgtgaa acacctgaaa 60 ggccagaatg aggaagccct gaagagctta aaagaagctg aaaacttaat gcaggaagaa 120 catgacaacc aagcaaatgt gaggagtctg gtgacctggg gcaactttgc ctggatgtat 180 taccacatgg gcagactggc agaagcccag acttacctgg acaaggtgga gaacatttgc 240 aagaagcttt caaatccctt ccgctataga atggagtgtc cagaaataga ctgtgaggaa 300 ggatgggcct tgctgaagtg tggaggaaag aattatgaac gggccaaggc ctgctttgaa 360 aaggtgcttg aagtggaccc tgaaaaccct gaatccagcg ctgggtatgc gatctctgcc 420 tatcgcctgg atggctttaa attagccaca aaaaatcaca agccattttc tttgcttccc 480 ctaaggcagg ctgtccgctt aaatccagac aatggatata ttaaggttct ccttgccctg 540 aagcttcagg atgaaggaca ggaagctgaa ggagaaaagt 580 108 562 DNA Homo sapiens 108 ctgcagtgag gggccaatat ccctatctcc taacacaggg caggaagggg gttcaaaggg 60 cttttatgaa gacctgccct ctagtttctc atggcagtat ttggaggttc tgatcacagg 120 gttgggcatc ttgatcaaga gttcctcctt gggtttgaag tcacagtggg caaggctgac 180 agcagcaacc ctttcacatg atttaagcgt tagaattata taattctgta cattggggca 240 atctcaaaag tagtaaaatt tttttttgtc ttttggctta accctagaga cacagtagac 300 cagtttcagc ttacatttaa gatggcagaa ggggagaaca aaaaatttga caggaattaa 360 agtgctaaga acatcacctt agaatcaatt ccaaggactt acatgaagag aaaaccatcc 420 tcccatatga aaatatttgc agtaggagaa cacagtgcaa taggctccaa aaatggcttt 480 taagaccttt ggtggggcag ttactactgc tttaaaagcc aggttaaagt atactcttaa 540 gcaaagatga ccgtagagca gc 562 109 354 DNA Homo sapiens 109 tggttgtggt aagactagtc tgaatcgtag aggaagagta atgcctggaa agagacgtcc 60 taatggagtt atcactggcc ttgcagctag gaaaacgact ggaattcgaa aaggaattag 120 tcctatgaat cgtccacctc taagtgacaa gaatatagaa caatattttc cagtgttaaa 180 aaggaaggca aaccttctga gacaaaatga agggcagagg aaaccagtag cagttctcaa 240 gagacctagc cagctaagca gaaaaaataa cattccagct aattttacca ggagtggaaa 300 taaattaaat catcagaaag atactcgtca ggcaactttt cttttcagaa gagg 354 110 217 DNA Homo sapiens misc_feature (1)...(217) n = A,T,C or G 110 ccagttctcc cacgccggat cccaattctt gttttggtgg catattccct gctaggtgtt 60 aacagccatt gattcttcgt gatggaattc ttcacaacag gggaggcctt ggtgaaagct 120 gggtggaaaa ccctagaaag gtagtgtccg gagacacacc agccttgcag ccaacatggt 180 ggnggtgcac caggtctacc cgaacagatc tgaaatg 217 111 147 DNA Homo sapiens 111 accgagcgtg agcaaggaga agtccgcttc tctgccgtgg ctctctgcaa ggcagcctaa 60 tgctctgtgg gagggacttt gctgatttcc cctcttccct tcaacatgaa aatatatacc 120 cccccatgca gtctaaaatg cttcagt 147 112 220 DNA Homo sapiens misc_feature (1)...(220) n = A,T,C or G 112 ctgaacagag agctggcaga acagaaggcc accgaaaagc agcacatcac gttagccttg 60 gagaaacaaa agctggaaga aaagcgggca tttgactctg cagtagcaaa agcattagaa 120 catcacagaa gtgaaataca ggctgaacag gacagaaaga tagaagaagt cagagatgcc 180 atggaaaatg aaatgagaac ccagcttcgn cgacaggcag 220 113 460 DNA Homo sapiens 113 agacatactg agctatgggt cagaagtgtt ttacttaaaa agcaaacaat ccccaggaaa 60 tactgaatag gaaccagcaa cacaaggcca gcttgtgttg tatgtttatt aatacagtct 120 aaaaaaaaaa agcaaaacca caacacacat ccccaaacaa taactctcaa tcacatagct 180 aattgcttca ttattttgta aaactgacat cctaacactg gcacctagaa tacttttcca 240 tctgagtcta acgtacccca ctgcttctaa tacggcccgc actacgatga gctacttcag 300 tgggtgggac caagcaggaa ctgtaaggga aaattagtca ctagttctat tatcgtttta 360 tttttcaaga tgtgtgacag gtacaggtga caatatggtt gccaagtaac ctgctctccc 420 tccctcaaca agacgcaaca tgaaacctgg aagtatgagg 460 114 596 DNA Homo sapiens misc_feature (1)...(596) n = A,T,C or G 114 tatttgtttc ctacagtatt tgatgagaat gagagtttac tcctggaaga taatattaga 60 atgtttacaa ctgcacctga tcaggtggat aaggaagatg aagactttca ggaatctaat 120 aaaatgcact ccatgaatgg attcatgtat gggaatcagc cgggtctcac tatgtgcaaa 180 ggagattcgg tcgtgtggta cttattcagc gccggaaatg aggccgatgt acatggaata 240 tacttttcag gaaacacata tctgtggaga ggagaacgga gagacacagc aaacctcttc 300 cctcaaacaa gtcttacgct ccacatgtgg cctgacacag aggggacttt taatgttgaa 360 tgccttacaa ctgatcatta cacaggcggc atgaagcaaa aatatactgt gaaccaatgc 420 aggcggcagt ctgaggattc caccttctac ctgggagaga ggacatacta tatcgcagca 480 gtggaggtgg aatgggatta ttccccacaa agggagtggg aaaaggagct gcatcattac 540 aagancagaa tgtttcaaat gcatttttag ataagggaga gttttacata ngctca 596 115 486 DNA Homo sapiens misc_feature (1)...(486) n = A,T,C or G 115 accaacgagt aacaaagaaa cagtaaatct tcatcttaac aacctttaat agttatctaa 60 atgcagagtt tgtttatgaa atgaaccaaa gcagtttgtc atttcttact ataaaatatt 120 gaaaatcaag tgcgaaactc agccactatt ggctaaagaa actaaataaa aaacgttaat 180 gacctagaaa agcaaaagcc cattttaaac tacttaaagc ctctgcacta gtccaatgag 240 tcaaaggcaa gggagaagta aacctctaaa actgaagaag accctctaaa ggagaaaact 300 atagaagtta aagtatgcat gcttattata ctatgggaaa ccaagaagga agggagagaa 360 aaagaaaaag cacagtctat tcatccagga catcagtaaa aatctacagt aacctgatcg 420 aaccaaaaat ccttangggg ttgtgaaata cattggtcac ttctngtatc ttaaacttaa 480 atgatt 486 116 562 DNA Homo sapiens 116 acagttaaaa aatcttaata tggaaaatga ccccaataat gaagaaatag atgaggttac 60 acatccaaat ggaaattatg acacagagct ccaagcctta catgaaatgg tccagcagaa 120 agttgttact ttgctaagtg accctgaaaa tattgtaaaa caaaccttga tggaaaatgg 180 aataacacgg ctgtgtgtat tctttggacg tcagaaagcc aacgatgttt tgttgtccca 240 catgattact ttcctaaatg ataagaatga ttggcatcta cgtggagcat tttttgatag 300 tataattggt gttgctgcct atgttggctg gcaaagctcc tcaattctca agcctctgct 360 gcaacaaggt cttagtgatg ctgaggaatt tgtcattgtg aaagctcttt atgcccttac 420 ttgtatgtgc cagttaggac tgctacaaaa accccatgtt tacgaatttg ccagtgatat 480 tgcccccttc ctgtgtcatc ccaatttatg gatacgttat ggtgccgtgg gatttatcac 540 agtggtagct cgtcaaataa gt 562 117 265 DNA Homo sapiens 117 tttttttttt tgctcagcaa tctttattca gttcttcttg ggggtgggat gcctcccttc 60 ccatgctccc acccctccca tcccagaact ccgttgggct cagtgtcctc tgttgaggga 120 aggtcttggt gcccagatgc ctactctgca ggagagggag gaaccttgtc cctttgcggg 180 agtcgctggt ctcttctgtt gtggggaagg aggaaggtgg gagggacact gtccaccagc 240 actcagagct ccattatgtc cccag 265 118 390 DNA Homo sapiens 118 tgcatcagtg caagttgcag gctggatata gtccatgatg tcgatgtgaa tatcactgag 60 aaccacacaa tttctgtcac ttgctgctcc agagacatca taaactatat taccaaaaat 120 tattccattt tctgttgatg ctactttgac gttagcttta atatttgcga agtcatgagg 180 agcaagagtc aaaggagacg gcttttccac aagtttcaga tcccctagtg tagctagttc 240 taatgtgcaa ttctgcaaag tatcactggt ttggttcaca acaagtacag gctgagtttc 300 agccacacca agaagtcaaa gctaaccgag gctgtgcctt ccgagacccc cgggatggcc 360 cctgggaggc caaggagtcg gggactgggt 390 119 651 DNA Homo sapiens misc_feature (1)...(651) n = A,T,C or G 119 gacaaactaa caaacaaaaa ttgttttgct ttgttacaag gtggggaaga ctgaagaagt 60 gttaactgaa aacaggtgac acagagtcac cagttttccg agaaccaaag ggaggggtgt 120 gtgatgccat ctcacaggca ggggaaatgt ctttaccagc ttcctcctgg tggccaagac 180 agcctgtttc agagggttgt tttgtttggg gtgtgggtgt tatcaagtga attagtcact 240 tgaaagatgg gcgtcagact tgcatacgca gcagatcagc atccttcgct gccccttagc 300 aacttaggtg gttgatttga aactgtgaag gtgtgatttt ttcaggagct ggaagtctta 360 gaaaagcctt gtaaatgcct atattgtggg cttttaacgt atttaaggga ccacttaaga 420 cgagattaga tgggctcttc tggatttgtt cctcatttgt cacaggtgtc ttgtgattga 480 aaatcatgag cgaagtgaaa ttgcattgaa tttcaaggga atttagtatg taaatcgtgc 540 cttanaaaca catctgttgt cttttctgtg tttggtcgat attaataatg gcaaaatttt 600 tgcctatcta gtatcttcaa attgtagtct ttgtaacaac caaataacct t 651 120 272 DNA Homo sapiens 120 ctgagggcac gcgctgcact ccgtaactca acatggcatg cctttctctc cgtaaactat 60 ttagtgagat ttttagggac tatttttcag tatctctgta cctgttaaag ggggtgcttt 120 tcgatctaaa aacttaattt tataaaattg acttattttt ctagactaaa attgtatatg 180 cttttggtaa ttaggaactc ttgagaatat tggctgctga ttgttgccat cacgttccta 240 caaaattgtt tttctatggg atgttctggc ag 272 121 307 DNA Homo sapiens 121 ccatcaggag aaaggtgttt gtcagttgtt tcataaacca gattgaggag gacaaactgc 60 tctgccaatt tctggatttc tttattttca gcaaacactt tctttaaagc ttgactgtgt 120 gggcactcat ccaagtgatg aataatcatc aagggtttgt tgcttgtctt ggatttatat 180 agagcttctt catatgtctg agtccagatg agttggtcac cccaacctct ggagagggtc 240 tggggcagtt tgggtcgaga gtcctttgtg tcctttttgg ctccaggttt gactgtggta 300 tctctgg 307 122 113 DNA Homo sapiens 122 aatttttaga cctaaattta aatccaaaag ctgtaaaact tctgtaaaat aacaaaaggg 60 atatttaggt aactttgggt ttggcaatta ctttgtagat acaacagtaa agt 113 123 643 DNA Homo sapiens 123 accttggatt cagccctcaa tgctgcctct tactataact tcacagtctt aaaggttcca 60 agaagcatga ctgatccaca gaatatggag ttccaggttc ctgtaatact tacctcacag 120 gttaatgctc ctctgttggc tggaaacact tgtcagaatg tagtttctca ggtcacctat 180 gagatagaga ccaatgggac ttttggaatc cagaaagttt ctgtcagttt gggacaaacc 240 aacctgactg ttgagccagg cgcttcctta cagcaacact tcatccttcg cttcagggct 300 tttcaacaga gcacagctgc ttctctcacc agtcctagaa gtgggaatcc tggctatata 360 gttgggaagc cactcttggc tctgactgat gatataagtt actcaatgac cctcttacag 420 agccagggta atggaagttg ctctgttaaa agacatgaag tgcagtttgg agtgaatgca 480 atatctggat gcaagctcag gttgaagaag gcagactgca gccacttgca gcaggagatt 540 tatcagactc ttcatggaag gcccagacca gagtatgttg ccatctttgg taatgctgac 600 ccagcccaga aaggagggtg gaccaggatc ctcaacaggc act 643 124 495 DNA Homo sapiens 124 tgtggcacaa tcggacttat tcacgcagtg gccaataatc aagacaaact gggatttgag 60 gatggatcag ttctgaaaca gtttctttct gaaacagaga aaatgtcccc tgaagacaga 120 gcaaaatgct ttgaaaagaa tgaggccata caggcagccc atgatgccgt ggcacaggaa 180 ggccaatgtc gggtagatga caaggtgaat ttccatttta ttctgtttaa caacgtggat 240 ggccacctct atgaacttga tggacgaatg ccttttccgg tgaaccatgg cgccagttca 300 gaggacaccc tgctgaagga cgctgccaag gtctgcagag aattcaccga gcgtgagcaa 360 ggagaagtcc gcttctctgc cgtggctctc tgcaaggcag cctaatgctc tgtgggaggg 420 actttgctga tttcccctct tcccttcaac atgaaaatat atacccccca tgcagtctaa 480 aatgcttcag tacct 495 125 52 DNA Homo sapiens 125 atccaaatta tctgcctctc ctcctttcct cctttttcta aggtcttctg gt 52 126 300 DNA Homo sapiens 126 ctgggcgatg tgcgagctga tagtgagcgg cagaatcagg agtaccagcg gctcatggac 60 atcaagtcgc ggctggagca ggagattgcc acctaccgca gcctgctcga gggacaggaa 120 gatcactaca acaatttgtc tgcctccaag gtcctctgag gcagcaggct ctggggcttc 180 tgctgtcctt tggagggtgt cttctgggta gagggatggg aaggaaggga cccttacccc 240 cggctcttct cctgacctgc caataaaaat ttatggtcca aaaaaaaaaa aaaaaaaaaa 300 127 367 DNA Homo sapiens 127 tctgcagacc ttggcagcgt ccttcagcag ggtgtcctct gaactggcgc catggttcac 60 cggaaaaggc attcgtccat caagttcata gaggtggcca tccacgttgt taaacagaat 120 aaaatggaaa ttcaccttgt catctacccg acattggcct tcctgtgcca cggcatcatg 180 ggctgcctgt atggcctcat tcttttcaaa gcattttgct ctgtcttcag gggacatttt 240 ctctgtttca gaaagaaact gtttcagaac tgatccatcc tcaaatccca gtttgtcttg 300 attattggcc actgcgtgaa taagtccgat tgtgccacag gaattcccaa tggtctgctt 360 catgaag 367 128 631 DNA Homo sapiens 128 ctgccggcaa ccacaggttc caagatggtt tgcgggggct tcgcgtgttc caagaactgc 60 ctgcgcgccc tcaacctgct ttacaccttg gttagtctgc tgctaattgg aattgctgcg 120 tggggcattg gcttcgggct gatttccagt ctccgagtgg tcggcgtggt cattgcagtg 180 ggcatcttct tgttcctgat tgctttagtg ggtctgattg gagctgtaaa acatcatcag 240 gtgttgctat ttttttatat gattattctg ttacttgtat ttattgttca gttttctgta 300 tcttgcgctt gtttagccct gaaccaggag caacagggtc agcttctgga ggttggttgg 360 aacaatacgg caagtgctcg aaatgacatc cagagaaatc taaactgctg tgggttccga 420 agtgttaacc caaatgacac ctgtctggct agctgtgtta aaagtgacca ctcgtgctcg 480 ccatgtgctc caatcatagg agaatatgct ggagaggttt tgagatttgt tggtggcatt 540 ggcctgttct tcagttttac agagatcctg ggtgtttggc tgacctacag atacaggaac 600 cagaaagacc cccgcgcaaa tcctagtgca t 631 129 266 DNA Homo sapiens 129 tttttctgca gaaaagtttt attccttttg aggacaaaca attaaaccac atccaaggtc 60 ttaacttaca gacagaaacc aaagtagcca tttaaagcgt tagatatcgg atacaagaca 120 tacactgggg agaatgcttc accatctgaa gctcgcacca caatggccca gtggacagct 180 gtgcactctg cttgtgctta agtgcctggg tgtggctgag gggaaggcgt gtctgcagaa 240 cagaagaaca gctgtgtttc acaagt 266 130 580 DNA Homo sapiens 130 acattttgga tttctttata taaggtcata gattcttgag ctgttgtggt ttttagtgca 60 cttaatatta gcttgcttaa ggcatacttt taatcaagta gaacaaaaac tattatcacc 120 aggatttata catacagaga ttgtagtatt tagtatatga aatattttga atacacatct 180 ctgtcagtgt gaaaattcag cggcagtgtg tccatcatat taaaaatata caagctacag 240 ttgtccagat cactgaattg gaacttttct cctgcatgtg tatatatgtc aaattgtcag 300 catgacaaaa gtgacagatg ttatttttgt atttttaaaa aacaattggt tgtatataaa 360 gtttttttat ttcttttgtg cagatcactt tttaaactca cataggtagg tatctttata 420 gttgtagact atggaatgtc agtgttcagc caaacagtat gatggaacag tgaaagtcaa 480 ttcagtgatg gcaacactga aggaacagtt accctgcttt gcctcgaaag tgtcatcaat 540 ttgtaatttt agtattaact ctgtaaaagt gtctgtaggt 580 131 78 DNA Homo sapiens 131 agagcgaatg gtcacagttg gtcgctgggt aaagggaatg agtgcaggct atgaagaaat 60 tttggatgta cctcggcc 78 132 285 DNA Homo sapiens 132 tttgtgatgg tcttccaatt tggggtaccc caggccctgt ttggggtgcg ccgtatgctg 60 cctctcatct ggtctaagga gcctggtgtc cgggaagccg tgcttaatgc ctaccgccaa 120 ctctacctca accccaaagg ggactctgcc agagccaagg cccaggcttt gattcagaat 180 ctctctctgc tgctagtgga tgcctcggtt gggaccattc agtgtcttga ggaaattctc 240 tgtgagtttg tgcagaagga tgagttgaaa ccagcagtga cccag 285 133 405 DNA Homo sapiens 133 acagaattat ataattctaa cgcttaaatc atgtgaaagg gttgctgctg tcagccttgc 60 ccactgtgac ttcaaaccca aggaggaact cttgatcaag atgcccaacc ctgtgatcag 120 aacctccaaa tactgccatg agaaactaga gggcaggtct tcataaaagc cctttgaacc 180 cccttcctgc cctgtgttag gagataggga tattggcccc tcactgcagc tgccagcact 240 tggtcagtca ctctcagcca tagcactttg ttcactgtcc tgtgtcagag cactgagctc 300 cacccttttc tgagagttat tacagccaga aagtgtgggc tgaagatggt tggtttcatg 360 tttttgtatt atgtatcttt ttgtatggta aagactatat tttgt 405 134 513 DNA Homo sapiens 134 ctgacttctg actcgagaca atggaaacaa aaaggagttg aatgatgtta aaagtatcca 60 aagaagaaaa ctgccaaccc agaattctat accagtaaaa acagtccttc aaaaatgaaa 120 gcaaaaaaga cattgtcagg taaacaagac tgaaggaatc cattgctggt agacctgcca 180 gttcttcagc ctgaaaataa atgacacgtt ggcagcttga gtctatgaga agaaataaag 240 aacactggaa atggtaaatg catgggttaa ctggaaagac ttaactagtt ctgcagtctg 300 gaagccaacc aaaggctttg aatgatttgt tgggtttata atatccacag ataaaatata 360 tgtcatatta gtacaaaaga tggggaatta aatggagcaa tattggagca aaatttctgt 420 atcttactgg aatcaagtta gcatcagtct tatatagatt gtgattaaga tgcttactgt 480 aatgcctaga gcagctctga acaaaataac ttt 513 135 425 DNA Homo sapiens 135 ctggccggga gcctgatcac ctgccctgct gagtcccagg ctgagcctca gtctccctcc 60 cttggggcct atgcagaggt ccacaacaca cagatttgag ctcagccctg gtgggcagag 120 aggtagggat ggggctgtgg ggatagtgag gtatcgcaat gtaagactcg ggattagtac 180 acacttgttg attaatggaa atgtttacag atccccaagc ctggcaaggg aatttcttca 240 actccctgcc ccccagccct ccttatcaaa ggacaccatt ttggcaagct ctatcaccaa 300 ggagccaaac atcctacaag acacagtgac catactaatt ataccccctg caaagcccag 360 cttgaaacct tcacttagga acgtaatcgt gtcccctatc ctacttcccc ttcctaattc 420 cacag 425 136 473 DNA Homo sapiens 136 tctcaattat gaaatcttgc agagaagtta tttttctttc tcaaaatcca ggtgatgaca 60 atattcctta ctccagatct ggcatttttt catcatcact gtcttgtgaa tcatcatctg 120 ctccatctac ttctggtaaa tctacatcct catcaccacc catgttgttc atcatctcag 180 agaaacgatc aaaattagac atgtcttcat ctgaatcatc ttcccagtct ttccaattat 240 tgaagtcgac actaagccaa ttaagctttg ccctttcttt tgttaacctt ggccatgact 300 ggccagattc tccttttcgt aaacaacata aaattgatct gtccgttctt ttatgcttgg 360 aatcatttgg atcaatacag tgaaaaagat caatttcatt taaatgctta aaattatcac 420 ttcctccgag acaactgaat gtaagtttgg atttttcaaa atttacatta aca 473 137 285 DNA Homo sapiens 137 ccaccaggat gcactttaca acatcaaagt gtatgtggcg ctggatagcc tggaccacct 60 gttcatagaa ccgctccaag gcccggtcat gctgagagca attgcctttc cttttcctag 120 ggatgttcac ctccaccttg gcccgagtga gggtcatgct gggagtgact aagcagatat 180 gggcgaggcc ttcctgcatg accacagccg ccacatcagc gctccaggct gggtcacagg 240 cctgctcgat gcgctccagt accacactat cccactgctt cttgg 285 138 432 DNA Homo sapiens 138 ctgtgactgt gaaaccctca ttttccttgg tctcttctga tcgcaggagg ctggcagatt 60 cccagtggat acggtgggtg atcttggagc tgcggctggg cagttggagg gacacatcaa 120 ggttcagttc ctggtggtca ggggcgtcct tttggtattg agccaaggct tggaacacca 180 tgaaggtggc ctgggtagag ccatagccac caccgtagta tctctgttca ttgagccaac 240 gcacgacggg aggcacaaag tcaaagtctt ttagctgcag tagggccaag agggcatagg 300 atgtggcctc cacgttgtag agctgcttac cagggtcctc ccagcggttc ttatctttgg 360 ctgtggtcag aaatttgtta agaagaggcc ccttcagcct gcccatctgg gccagagcat 420 agccagcaat gg 432 139 241 DNA Homo sapiens 139 actgtgcctt gacctcagca atgatgctgt ccatgtccag ggagcggctg ttgtccatgg 60 acagcaccac agatgtgtcc gagatctggg actgcagctc ccggatctcc tcttcataca 120 gctgcctgag gaagttgatc tcgtcggtca gcccttccag gcgagactcc agctctacct 180 tgttcatgta agcttcatcc acatccttct tgatgaggac aaattcgttc tccatctctg 240 t 241 140 464 DNA Homo sapiens misc_feature (1)...(464) n = A,T,C or G 140 accctccacc tggagaggct tgcctatctg catgccaggc tcagggagtt gctgtgtgag 60 ttggggcggc ccagcatggt ctggcttagt gccaaccccc gtcctcactg tggggacana 120 accttctatg acccggagcc catcctgtgc ccctgtttca tgcctaacta gctaggtgca 180 catatcaaat gcttcattct gcatacttgg acactaaagc caggatgtgc atgcatcttg 240 aagcaacaaa gcagccacag tttcagacaa atgttcagtg tgagtgagga aaacatgttc 300 agtgaggaaa aaacattcag acaaatgttc agtgaggaaa aaaaggggaa gttgggggta 360 ggcagatgtt gacttgagga gttaatgtgc tctttgggga gatacatctt atagagttag 420 aaatagaatc tgaatttcta aagggagatt ctggcttggg aagt 464 141 454 DNA Homo sapiens 141 acccagggtg agttggtaga gaacagagag atgagaagca gagggcttgg ggaaagcctg 60 ttcctctctg actcagccct ttttggcatt attgcaagag cttgactcct ggttgccttt 120 tcccagccag ttttcagttg gggtgaaggt ttctgcaagt gtgaggtcca gatgctgctg 180 ctcatgttgg gctttccttt tgggaactat ttctctttat ttatagtgtc gggcttccgg 240 ggaaagcaat cattggtgtg tatgtgtatg tgcatgcaca cacgtgcata tacacatttg 300 tgtatgtgga aatgtgctgg gcaagtcaaa actatagaag agttgcctcc tgtctctcga 360 atcttccaga gatatcactt aattgttaac agcttttgtg ttaatcccct tcagccccta 420 gctcttttat tctaccacgg ctggagagtt gata 454 142 279 DNA Homo sapiens 142 acacagaggg agaggctagc agtattttta aattggtttc taaatttttt atagcttgat 60 ggtagataac acatttgctt cattgaagta atctgaaaaa ccaatcctca aaagacctct 120 caattagaat tcaccgagcg tgagcaagga gaagtccgct tctctgccgt ggctctctgc 180 aaggcagcct aatgctctgt gggagggact ttgctgattt cccctcttcc cttcaacatg 240 aaaatatata cccccccatg cagtctaaaa tgcttcagt 279 143 234 DNA Homo sapiens 143 ccagcgacct cccggttcaa ttcttcagtc cggctggtga accaggcttc agcatccttc 60 cggttctgct cggccatgac ctcatattgg cttcgcatgt cactcaggat cttggcgaga 120 tcggtgcccg gagcggaatc cacctccaca ctgacctggc ctcccacttg gcccctcagc 180 gtactgattt cctcctcatg gttcttcttc aggtaggcca gctcttcctt cagg 234 144 274 DNA Homo sapiens 144 ctgaggatgc caaggacttc ttcaagagga agatagattt tctaaccaag cagatggaga 60 aaatccaacc agctcttcag gagaagcacg ccatgaaaca ggccgtcatg gaaatgatga 120 gtcagaagat tcggcagctc acagccctgg gggcagctca ggctactgct aaggcctgag 180 agtttttgca gaaatggggc agagggacac cctttgggcg tggcttcctg gtgatgggaa 240 gggtcttgtg ttttaatgcc aataaatgtg ccag 274 145 395 DNA Homo sapiens 145 acaaagtaaa atagaaccac aaaataatga actgcatgtt cataacatac aaaaatcgcc 60 gcctactcag taggtaacta caacattcca actcctgaat atatttataa atttacattt 120 tcagttaaaa aaatagactt ttgagagttc agattttgtt ttagattttg ttttcttaca 180 ttctggagaa ccgaagctca gctcagccct cttccttatt ttgctcccaa agcctccccc 240 aaatcatcac tccctgcccc ccttaaggct agaggtgagc atgtccctca caattgcaca 300 tgtcaagcca tcagcaaggc gcatcacaca aaaggcacca agacgtgaaa ctttttaaac 360 caaaagggac gaagaaaaaa cacttttcaa aaaaa 395 146 603 DNA Homo sapiens misc_feature (1)...(603) n = A,T,C or G 146 ccccgacctt gtagaaaaac ccctaccctc acaatacctt atttaagtaa ctttaaatta 60 tgccgttact tttcatattt gcactaagat atttccaggc tgcatttgta tatttagatt 120 ttttggttaa gctttgacac tggaatgagt tgaaaaaatg tgccattttg cattttcatc 180 tactcattta aagtatttta ttcttattca aagaaatatc tgagctcttt gcactacctg 240 ttatcagtag tgcctttact tcaggcttga taatacttag gtgtgattat aaaatcatga 300 agcaggtaaa gggaggggca agcccccaaa ctgctgtggg gacattttat aatctatatg 360 ctgcacccac ttaatctact gtggtgtttt gtttattagt tttgcataat ttcagcttct 420 atatattgna tgtatatatt ttttaaaaat ctatattttg ggaaaaaaac atacacaatg 480 tgtctttctt tttggacatt tacctttttg aaaaagaaaa cacttaaaat gatcattagg 540 acataacaga ctaggccaga catagcatct tgtggctttg caaccatttt catttggttg 600 gtt 603 147 269 DNA Homo sapiens misc_feature (1)...(269) n = A,T,C or G 147 tttttttttt tttttttttt ttttttttct cttggaccat aaatttttat tggcaggtca 60 ggaaaanagc cgggggtaag ggtcccttcc ttcccatccc tntacccana anacaccctc 120 caaaggacag canaagcccc anagcctgct gcctcaaagg accttggagg canacaaatt 180 gttgtagnga tcttcctgtc cctcgagcag gctgcggtag gnggcaatnt cctgctccag 240 ccgcgacttg atgtccatga gccgctggt 269 148 604 DNA Homo sapiens misc_feature (1)...(604) n = A,T,C or G 148 atatctcttg tatgccaaat atttaatata aatctttgaa acaagttcag atgaaataaa 60 aatcaaagtt tgcaaaaacg tgaagattaa cttaattgtc aaatattcct cattgcccca 120 aatcagtatt tttttttatt tctatgcaaa agtatgcctt caaactgctt aaatgatata 180 tgatatgata cacaaaccag ttttcaaata gtaaagccag tcatcttgca attgtaagaa 240 ataggtaaaa gattataaga caccttacac acacacacac acacacacac acgtgtgcac 300 gccaatgaca aaaaacaatt tggcctctcc taaaataaga acatgaagac ccttaattgc 360 tgccaggagg gaacactgtg tcacccctcc ctacaatcca ggtagtttcc tttaatccaa 420 tagcaaatct gggcatattt gagaggagtg attctgacag ccacgttgaa atcctgtggg 480 gaaccattca tgtccaccca ctggtgccct gaaaaaatgc caataatttt tcgctcccac 540 ttctgctgct gtctcttcca catnctcaca tagaccccag acccggtggc ccctggctgg 600 gcat 604 149 632 DNA Homo sapiens 149 acttaatttg tgcaagagac tgggctatgt gctgggggtg aggtggaaat acaaaaacac 60 caagatgcaa tccctctcaa gaactgtata atctagtaag agcacataca gagatggtgc 120 ttgcaggtaa aaactgctct gaaaccatgg ggagagaaga gtttacttcc ttccagaggg 180 tgaagtcggg acccatttaa atttggtagt atgggtgagg aaggtcataa cacgctaagt 240 aaactggtgt ctaagcatgt gacggcaaca gctaatggtc tagttcctcc atggctttaa 300 atgcatgaaa gggaaaagag tattcaaagg tatttttatt ttatctcatt gttagcccag 360 tataaggcag gatgacaaaa aataaataaa agtatgaaga ggcaagaaca tagattgaaa 420 actccatttc ctagttttag tgtaaactca atcccttgtg catatacatc tagttcctga 480 agtccacact gccaaaaggg aaaaacaaga aaaaccagcc ctagcagtgc cctgtcatca 540 tggcagagca ccgtctcttc tgtgggactg aaacagctag ctttggctac tgacggtagt 600 ggacaatatg gcacatggaa attaaaaagt cc 632 150 398 DNA Homo sapiens 150 gctgagtccc aggctgagcc tcagtctccc tcccttgggg cctatgcaga ggtccacaac 60 acacagattt gagctcagcc ctggtgggca gagaggtagg gatggggctg tggggatagt 120 gaggcatcgc aatgtaagac tcgggattag tacacacttg ttgattaatg gaaatgttta 180 cagatcccca agcctggcaa gggaatttct tcaactccct gccccccagc cctccttatc 240 aaaggacacc attttggcaa gctctatcac caaggagcca aacatcctac aagacacagt 300 gaccatacta attatacccc ctgcaaagcc cagcttgaaa ccttcactta ggaacgtaat 360 cgtgtcccct atcctacttc cccttcctaa ttccacag 398 151 77 DNA Homo sapiens misc_feature (1)...(77) n = A,T,C or G 151 acctttgcta aggttagaat gaataattta ttgtattttt aattngaatg nntgtgcttt 60 ttaaatganc caagact 77 152 126 DNA Homo sapiens 152 aaagcccttt cctcaaaaca gacatttctc cttatcattc ttccttcagg ggaagtattg 60 aaagtcattg aaagtcatcc ccacccacta gaggagagaa acgcaagccc agaaaaggca 120 agaggt 126 153 359 DNA Homo sapiens 153 accccacagc atagtcagtg ttgctgaatc tgtcaggact tgcatcctcc tgacctcgga 60 acaggggact gtgatggcct gtggtcccac gccaggctcc atgaaaatcg taggtcatga 120 tgctaatgaa atccaggtgt tgggatatct tggcaatgtc atagctgctg tcaatggtga 180 ccttccccgc agacagtgct gcgctgagca ggagctgctt tttccctggc tgggcttcct 240 ttataaattc ggccttcatt tccttgatta gggtggtaaa atgctgtttg tctctccgtc 300 cagggtagag ccaggcaagg tccagcccat caaagccatg ggtgcgcaga aatggcggt 359 154 543 DNA Homo sapiens 154 ctttggtggt agcaggaaca tggggggacc atatggtgga ggaaactatg gtccaggagg 60 cagtggagga agtgggggtt atggtgggag gagccgatac tgagcttctt cctatttgcc 120 atgggcttca ctgtataaat aggagaggat gagagcccag aggtaacaga acagcttcag 180 gttatcgaaa taacaatgtt aaggaaactc ttatctcagt catgcataaa tatgcagtga 240 tatggcagaa gacaccagag cagatgcaga gagccatttt gtgaatggat tggattattt 300 aataacatta ccttactgtg gaggaaggat tgtaaaaaaa aatgcctttg agacagtttc 360 ttagcttttt aattgttgtt tctttctagt ggtctttgta agagtgtaga agcattcctt 420 ctttgataat gttaaatttg taagtttcag gtgacatgtg aaaccttttt taagattttt 480 ctcaaagttt tgaaaagcta ttagccagga tcatggtgta ataagacata acgtttttcc 540 ttt 543 155 266 DNA Homo sapiens 155 tgtgagctca tctgcctgag gactggagaa ctggcagaga agccaacgga gttattgggt 60 ccttcattca taccctgtaa agatgttact gggatgtttg aataagatcg gttattatta 120 ttacaggcac tacttgtcat gccaacggga gagcttaatg tcgaaatatt aggaggaaag 180 gaattgttag gcatcctggg ccttgttgcc aatccagaag taacctgtct ccttggagac 240 atgaactgtg ctagggatat tcctgt 266 156 652 DNA Homo sapiens misc_feature (1)...(652) n = A,T,C or G 156 acgcttatgg tagtgggtga atctggattg ggaaagtcga cattaatcaa ctcattattc 60 ctcacagatt tgtattctcc agagtatcca ggtccttctc atagaattaa aaagactgta 120 caggtggaac aatccaaagt tttaatcaaa gaaggtggtg ttcagttgct gctcacaata 180 gttgataccc caggatttgg agatgcagtg gataatagta attgctggca gcctgttatc 240 gactacattg atagtaaatt tgaggactac ctaaatgcag aatcacgagt gaacagacgt 300 cagatgcctg ataacagggt gcagtgttgt ttatacttca ttgctccttc aggacatgga 360 cttaaaccat tggatattga gtttatgaag cgtttgcatg aaaaagtgaa tatcatccca 420 cttattgcca aagcagacac actcacacca gaggaatgcc aacagtttaa aaaacagata 480 atgaaagaaa tccaagaaca taaaattaaa atatacgaat ttccagaaac agatgatgaa 540 gaagaaaata aacttgttaa aaagataaag gaccgtttac ctcttgctgt ggtaggtagt 600 aatactatca ttgaagttaa tggcaaaagg gtcagangaa ggcagtatcc tt 652 157 142 DNA Homo sapiens 157 atcaaaacta ggcagattta agaatttatt taaccacaaa gaatgctcaa aactattatt 60 caacaggaat caagcccaaa ccctggagtt gactgctgac cgtattcggt ttgggctttt 120 cccagaatgg aaacactttt cc 142 158 570 DNA Homo sapiens 158 ctggtgctgt tacagaaggc agtgcaggag gcttccaacc agagcatctg cggagaagga 60 ggcacagcag gtgcctgaag ggaagcaggc aggagtccta gcttcacgtt agacttctca 120 ggtttttatt taattctttt aaaatgaaaa attggtgcta ctattaaatt gcacagttga 180 atcatttagg cgcctaaatt gattttgcct cccaacacca tttcttttta aataaagcag 240 gatacctcta tatgtcagcc ttgccttgtt cagatgccag gagccggcag acctgtcacc 300 cgcaggtggg gtgagtcttg gagctgccag aggggctcac cgaaatcggg gttccatcac 360 aagctatgtt taaaaagaaa attggtgttt ggcaaacgga acagaacctt tgatgagagc 420 gttcacaggg acactgtctg ggggtgcagt gcaagccccc ggcctcttcc ctgggaacct 480 ctgaactcct ccttcctctg ggctctctgt aacatttcac cacacgtcag catctaatcc 540 caagacaaac attcccgctg ctcgaagcag 570 159 633 DNA Homo sapiens 159 tttttttttt tttttttaac attattggta gctttattaa atttgtttac cttctaaaaa 60 yaaacgattac aaaaaagaat acttcattta agtgtaatac tggctttatg gacgtaccgt 120 gatcagaaag tgaaattaaa gctcatggat atgcgtgaga agagaatggg cgcagaggca 180 cgagtccagt atcccacgga gagaaggaag tgtagagaga tgcgtggacc catctcaggg 240 gtcacgcatt cctgggccaa ggagttgctt ctaagagctt aaaataaatg cactggctgg 300 ctccagggct ggacacacag gcacagaact cttgtaccgc ttctcaaagc tgcccaaagc 360 tgcgaccttc cccctcagac acccacaatt aattgtttcc aatcagcttt ggttttgttt 420 tctctgcaat ttacatgcaa acaccatcta tctattcatc catacatcca tccatccatc 480 catccatcca tcctgcttgt ctcctttgac cagccaagcc ctattttgtt ttacgtacat 540 tcccttcact aaaaatcaaa gcaagccagt ctacatcaag ccaagaaacc tacttttcta 600 aggagagaaa ccagtgcttt gccttcttta atg 633 160 78 DNA Homo sapiens 160 atctccgagc tgaggatgcc ttgtttttct ttgtcaacaa tgtcattcca cccaccagtg 60 ccacaatggg tcagctgt 78 161 303 DNA Homo sapiens misc_feature (1)...(303) n = A,T,C or G 161 tttttttttt tttttttttt ttttggacca taaattttta ttggcaggtc agganaanag 60 ccgggggtaa gggtcccttc cttcccatcc ctctacccan aanacaccct ccaaaggaca 120 gcanaagccc canagcctgc tgcctcanag gaccttggag gcagacaaat tgttgtagng 180 atcttcctgt ccctcgagca ggctgcggta ggnggcaatc tcctgctcca gccgcgactt 240 gatgtccatg agccgctggt actcctgatt ctgccgctca ctatcagctc gcacatcgcc 300 can 303 162 310 DNA Homo sapiens misc_feature (1)...(310) n = A,T,C or G 162 acattgcggc tccctccggt tctgctgctg acaaagttgt gactgaggcc tgcgacgaac 60 tgggaatcat cctcgctcat acgaaccttc ggctcttcca ccactgattt taccacacac 120 tgttttttgg cttgcttatg tgtaggtgaa cagtcacgcc tgaaactttg aggataactt 180 tttaaaaaaa taaaacagta tctcttaatc actggaaaaa aaaaaaaaaa aaaaaaaaaa 240 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 300 aaaaancccc 310 163 222 DNA Homo sapiens 163 agaatcagaa taagacctag gatggcttca aggacacatg gacaaggagc tgagtagtga 60 ctgagtaaga aataagtgaa gaagctgaag gaagagccga atgaggtgtc tgtgaagagc 120 actttagcca acagcccagc ccagatctgc agctctgcta gatggctgac tggagaaacc 180 ttccagccaa gccctaagat cccatcacat cctggatcat gt 222 164 131 DNA Homo sapiens misc_feature (1)...(131) n = A,T,C or G 164 acaagctttt tttttttttt tttttttttt tttttttttc tttatcanca aaaaagtttt 60 attccttttg aggacaaaca attaaaccac atncaaggnc ttaacttaca nacagaaacc 120 aaatnaccca t 131 165 542 DNA Homo sapiens 165 gatacagtcg acttgtctgc agtagatgtc catccaaacg tatcaatagg tgtttttatt 60 gagcaaccaa ccccttttct acctcggttt ctggacatat tgttgacact ggattaccca 120 aaagaagcac ttaaactttt tattcataac aaagaagttt atcatgaaaa ggacatcaag 180 gtattttttg ataaagctaa gcatgaaatc aaaactataa aaatagtagg accagaagaa 240 aatctaagtc aagcggaagc cagaaacatg ggaatggact tttgccgtca ggatgaaaag 300 tgtgattatt actttagtgt ggatgcagat gttgttttga caaatccaag gactttaaaa 360 attttgattg aacaaaacag aaagatcatt gctcctcttg taactcgtca tggaaagctg 420 tggtccaatt tctggggagc attgagtcct gatggatact atgcacgatc tgaagattat 480 gtggatattg ttcaagggga atagagtagg agtatggaat gtcccatata tggctaatgt 540 gt 542 166 73 DNA Homo sapiens 166 agagaaaaag ccttctttcc cccaattgct ggagaaaaga tccgatctcc acaagcactc 60 ctttggagaa aaa 73 167 277 DNA Homo sapiens 167 acacttcctt agggcagaaa caccatccta tcaggtttgg tcagtccctt cttcatgaag 60 ggagtcatgg ggaattcctg aaaattttct tccttctgca gacagttgga tgagtccctt 120 agagaaggca tccagagaca taactaaact gaatatcatc ccatattgat tttaggaatt 180 gactctaaaa ctctgtgcag aatcttgtgt tgggattgta tcttgacatt cctgttgtgt 240 tatttttctt aactggagtg tgtgctgcct ttcaggt 277 168 315 DNA Homo sapiens 168 tcaaactgaa gatcatgcct tggcacctgt gaggaacact attcaactcc caacacaacc 60 tttgaattca gcgtggtcgc ggccgaggtc cagtaatttg catctggata gctaaaaagc 120 tttcaatagg gctttattca acaaatagtg ggtacctact cttcgtggat tggaaatgac 180 tacaaattct ttgaagctcc ttccatcaaa aggtggggtt tatttccctc gattctgggc 240 tagctttcaa gtcaaaagaa gatcttgtac tttctgttct caccctcttc aatctcagat 300 ctcagcctgg gctgg 315 169 30 DNA Artificial Sequence PCR primer 169 ataagaattc gagcggccgc ccgggcaggt 30 170 29 DNA Artificial Sequence PCR primer 170 ccggaattca gcgtggtcgc ggccgaggt 29 171 30 DNA Artificial Sequence PCR primer 171 ccggaattcc agcgtggtcg cggccgaggt 30 172 31 DNA Artificial Sequence PCR primer 172 ccggaattca cagcgtggtc gcggccgagg t 31 173 496 DNA Homo sapiens 173 gggaaagcac tatggtgtgt gtggacaaca gtgagtatat gcggaatgga gacttcttac 60 ccaccaggct gcaggcccag caggatgctg tcaacatagt ttgtcattca aagacccgca 120 gcaaccctga gaacaacgtg ggccttatca cactggctaa tgactgtgaa gtgctgacca 180 cactcacccc agacactggc cgtatcctgt ccaagctaca tactgtccaa cccaagggca 240 agatcacctt ctgcacgggc atccgcgtgg cccatctggc tctgaagcac cgacaaggca 300 agaatcacaa gatgcgcatc attgcctttg tgggaagccc agtggaggac aatgagaagg 360 atctggtgaa actggctaaa cgcctcaaga aggagaaagt aaatgttgac attatcaatt 420 ttggggaaga ggaggtgaac acagaaaagc tgacagcctt tgtaaacacg ttgaatggca 480 aagatggaac cggttc 496 174 497 DNA Homo sapiens misc_feature (1)...(497) n = A,T,C or G 174 ggtgttggcg gtctggctca gctgggcagg gggtaacttt actgatttgg gggtggtttt 60 tagtttaatt tttcttttct agcttcccat cgacggtcag tgcgcacgtt gtaatcagct 120 gaggccatgt caggagacgg agccacggag caggcagctg agtatgtccc agagaaggtg 180 aagaaagcgg aaaagaaatt agaagagaat ccatatgacc ttgatgcttg gagcattctc 240 attcgagagg cacagaatca acctatagac aaagcacgga agacttatga acgccttgtt 300 gcccagttcc ccagttctgg cagattctgg aaactgtaca ttgaagcaga gattaaagct 360 aaaaattatg acaaggttga aaagctattt cananatgcc ttatgaaggt tttgacattg 420 atttatggaa gtgttatctt tcatatgtcc cgaaaaacca agggtaaact cccaagtacc 480 aaanaaaaaa atggctc 497 175 498 DNA Homo sapiens misc_feature (1)...(498) n = A,T,C or G 175 ggagagagag agagagagag agagagagag agagagagag agagagagag agagagagag 60 agagagagag agagagagag agangggcac ctccccaagg tgcagatgcc ctgtttgaan 120 atgcccaaag tggccctcaa gggcccccag gtggatgtca agggccccaa gctggacctg 180 aaaggcccca aggcggatgt gatgaccccc gtcgtggagg tgtctctgcc cagcatggag 240 gtggacgtcg aggccccggg agccaagctg gacagtgcgc ggctggaggg tgacctgtcc 300 ctagccgaca aggacatgac tgncaaagac agcaaagttc aaaaatgccc nagtttcaan 360 atgcccgtcg ttcnggggtg tcttgccccc cnngccaang tcccttcgan gnccntcngg 420 tngggattgt tgtcttgccc cccnaaaggt ggganggctt tgaccnttga agcccttncc 480 ccttccnttg ccgggggg 498 176 498 DNA Homo sapiens 176 gcttgcgcgc tggcactccg cagcctttaa ggttcgcgcg ggggccaggc aagagttagc 60 catgaagagc ctcaagtccc gcctgaggag gcaggacgtg cccggccccg cgtcgtctgg 120 cgccgccgcc gccagcgcgc atgcagcaga ttggaataaa tatgatgacc gattgatgaa 180 agcagcagaa aggggggatg tagaaaaagt gacctcaatc cttgctaaaa agggggtcaa 240 tccaggcaaa ctagatgtgg aaggcagatc tgtcttccat gttgtgacct caaaggggaa 300 tcttgagtgt ttgaatgcca tccttataca tggagttgat attacaacca gtgacactgc 360 agggagaaat gctcttcacc tggctgctaa gtatggacat gcattgtgcc tacaaaaact 420 tctacagtac aattgtccca ctgagcatgc agacctgcag ggaagaactg cacttcacga 480 tgcccgcaat ggcagatt 498 177 626 DNA Homo sapiens misc_feature (1)...(626) n = A,T,C or G 177 ggcgggctcg agccgatgcc cgattccgcg cccgccatgg ccgacaaaat ggacatgtct 60 ctggacgaca tcattaaact gaaccggagc cagcgaggcg gccggggcgg gggccggggc 120 cgcggccggg ccggctccca gggcggccgc ggcggtgggg cgcaggccgc cgcgcgagtg 180 aatcgaggcg gcgggcccat ttcggaaanc ngnggggcca tcgccccggc gcggccggcg 240 gaggcggcag gaaccgaccg gcgccctaca gcaggccaaa acaacttccc gacaagtggc 300 agcacgatct tttcnacagt ggcttcggcg gtggtgccgg cgtggagaca ggtgggaaac 360 tgctggtgtc caatctggat tttggagtct cagacgccga tattcaggaa ctctttgctg 420 aatttggaac gctgaagaag gcggctgtgc actatgatcg ctctggtcgc agcttangaa 480 cagcagacgt gcactttgag cggaaggcag atgccctgaa gccatgaagc agtacaacgg 540 cgtccctctg gatggccgcc ccatgaacat tcagcttgtc acgtcacaga ttgacgcaca 600 gcggaggcct gcacagagcg taaaca 626 178 497 DNA Homo sapiens 178 gacggctcag accgaggcgc acaggctcgc agctccgcgg cgcctagcgc tccggtcccc 60 gccgcgacgc gccaccgtcc ctgccggcgc ctccgcgcgc ttcgaaatga gggtcctggg 120 tgggcgctgc ggggcgctgc tggcgtgtct cctcctagtg cttcccgtct cagaggcaaa 180 ctttttgtca aagcaacagg cttcacaagt cctggttagg aagcgtcgtg caaattcttt 240 acttgaagaa accaaacagg gtaatcttga aagagaatgc atcgaagaac tgtgcaataa 300 agaagaagcc agggaggtct ttgaaaatga cccggaaacg gattattttt atccaaaata 360 cttagtttgt cttcgctctt ttcaaactgg gttattcact gctgcacgtc agtcaactaa 420 tgcttatcct gacctaagaa gctgtgtcaa tgccattcca gaccagtgta gtcctctgcc 480 atgcaatgaa gatggat 497 179 496 DNA Homo sapiens misc_feature (1)...(496) n = A,T,C or G 179 gctgatgtgg aggtttctct gcccagcgtg gaggtggacg tcgaggcccc aggagcaaag 60 ctggatggtg cgcggctgga gggggacctg tccctggccg acaaggacat gacggccaaa 120 gacagcaagt tcaaaatgcc caagttcaag atgccgtcgt tcggggtgtc tgccccaggc 180 aagtccatgg aggcatcagt ggatgtgacc gcgccaaagg tggaggccga cgtgagcctc 240 ccttccatgc agggggacct caaggccact gacctcagcg ttcagccccc ttccgctgac 300 ctggaggtcc aggctggcca agtggacgtg aaactcccag agggccccgt gcccgaggga 360 gccagcctca aagggcacct gcccaaggtg cagatgccca gtttcaagat gcccaaagtg 420 gacctcaagg gcccccagat agatgttaag ggccccaact tggacctgaa aggccccaag 480 gcggaagtga cangcc 496 180 438 DNA Homo sapiens 180 ttggagatcg acctggactc catgagaaat ctgaaggcca gcttggagaa cagcctgagg 60 gaggtggagg ccccgctacg ccctacagat ggagcagctc aacgggatcc tgcttgcacc 120 ttgagtcaga gctggcacag acccgggcag agggacagcg ccaggcccag gagtatgagg 180 ccctgctgaa catcaaggtc aagctggagg ctgagatcgc cacctaccgc cgcctgctgg 240 aagatggcga ggactttaat cttggtgatg ccttggacag cagcaactcc atgcaaacca 300 tccaaaagac caccacccgc cggatagtgg atggcaaagt ggtgtctgag accaatgaca 360 ccaaagttyt gaggcattaa gccagcagaa gcagggtacc ctttggggag caggaggcca 420 ataaaaagtt cagagttc 438 181 499 DNA Homo sapiens 181 ggggttgtcc tttgcatctg cacgtgttcg cagtcgtttc cgcgatgctg cctctgctgc 60 gctgcgtgcc ccgtgtgctg ggctcctccg tcgccggcct ccgcgctgcc gcgcccgcct 120 cgcctttccg gcagctcctg cagccggcac cccggctgtg cacccggccc ttcgggctgc 180 tcagcgtgcg cgcaggttcc gagcggcggc cgggcctcct gcggcctcgc ggaccctgcg 240 cctgtggctg tggctgcggc tcgctgcaca ccgacggaga caaagctttt gttgatttcc 300 tgagtgatga aattaaggag gaaagaaaaa ttcagaagca taaaaccctc cctaagatgt 360 ctggaggttg ggagctggaa ctgaatggga cagaagcgaa attagtgcgg aaagttgccg 420 gggaaaaaat cacggtcact ttcaacatta acaacagcat cccaccaaca tttgatggtg 480 aggaggaacc ctcgcaagg 499 182 702 DNA Homo sapiens 182 gccagatgat ggaggagcgt gccaacctaa tgcacatgat gaaactcagc atcaaggtgt 60 tgctccagtc ggctctgagc ctgggccgca gcctggatgc ggaccatgcc cccttgcagc 120 agttctttgt agtgatggag cactgcctca aacatgggct gaaagttaag aagagtttta 180 ttggccaaaa taaatcattc tttggtcctt tggagctggt ggagaaactt tgtccagaag 240 catcagatat agcgactagt gtcagaaatc ttccagaatt aaagacagct gtgggaagag 300 gccgagcgtg gctttatctt gcactcatgc aaaagaaact ggcagattat ctgaaagtgc 360 ttatagacaa taaacatctc ttaagcgagt tctatgagcc tgaggcttta atgatggagg 420 aagaagggat ggtgattgtt ggtctgctgg tgggactcaa tgttctcgat gccaatctct 480 gcttgaaagg agaagacttg gattctcagg ttggagtaat agatttttcc ctctacctta 540 aggatgtgca ggatcttgat ggtggcaagg agcatgaaag aattactgat gtccttgatc 600 aaaaaaatta tgtggaagaa cttaaccggc acttgagctg cacagttggg gatcttcaaa 660 ccaagataga tggcttggaa aagactaact caaagcttca ag 702 183 499 DNA Homo sapiens misc_feature (1)...(499) n = A,T,C or G 183 gccgagccat gaacaccgaa atgtatcaga cccccatgga ggtggcggtc taccagctgc 60 acaatttctc catctccttc ttctcttctc tgcttggagg ggatgtggtt tccgttaagc 120 tggacaacag tgcctccgga gccagcgtgg tggccataga caacaagatc gaacaggcca 180 tggatctggt gaagaatcat ctgatgtatg ctgtgagaga ggaggtggag atcctgaagg 240 agcagatccg agagctggtg gagaagaact cccagctaga gcgtgagaac accctgttga 300 agaccctggc aagcccagag cagctggaga agttccagtc ctgtctgagc cctgaagagc 360 cagctcccga atccccacaa gtgcccgang cccctggtgg ttctgcggtg taaagtggct 420 ctgtcctcan ggtgggcana gccactaaac ttgntttacc tanttctttc cantttgntt 480 ttgggttccc aagcattat 499 184 480 DNA Homo sapiens misc_feature (1)...(480) n = A,T,C or G 184 ggcttttcga gggacctggc acgtncatcc cccggaagga agtggaggtc gtggagatca 60 ttcaggccac catcatcang cagaaccang ctntgcngnt canggcccgc aaggagtgct 120 gggaccggga cggcaaggag agggtgacag gggaagaatg gctggtcacc acagtagggg 180 cgtacctccc agcggtgttt gaggaggttc tggatttggt ggacgccgtc atccttacgg 240 aaaagacagc cctgcacctc cgggctcggc ggaacttccg ggacttcagg ggagtgtccc 300 gccgcactgg ggaggagtgg ctggtaacag tngcaggaca cagaggccca cgtgccagat 360 gtccacgagg aggtgctggg ggttgtgccc atcaccaccc tggcccccac aactactgcg 420 tgattctcga ccctgtcgga ccggatggca agaatcagct ggggcagaag cgcgtggtca 480 185 611 DNA Homo sapiens misc_feature (1)...(611) n = A,T,C or G 185 ggngtgttga gagcggtgtg gcaggtgttg tngccgttat ggtgaagttc gctttgtagc 60 ggccccggct agagagttgt tctgttccct gcctttgtga cccggaggag cttttggggt 120 gcgtcaagcc cctggcctga ggcagcgaac tggtttgtgg cctgtttgat tcctgtcaga 180 ggtttgctga cccaagacag tatcgaaaat gcatattaag tcaattattc tagagggatt 240 caagtcctat gctcagagga ccgaagtcaa tggttttgac cccctcttca atgctatcac 300 tggcttaaat ggtagtggga aatccaacat attggactcc atctgctttt tgctgggcat 360 ctccaacctg tctcaggttc gggcttctaa tttacaagat ttagtttaca aaaatgggca 420 ggctggtatt accaaagcct ctgtgtcaat cacttttgat aattctgaca aaaagcaaag 480 tcctttanga tttgaggttc atgatgaaat cacagtaaca aggcaggtgg ttatttggtg 540 gtaagaaata aatatttaat caatggagtc aatgccacaa cacccagaag tacaggaact 600 cttctgttct g 611 186 215 DNA Homo sapiens 186 gctttcacaa agacactatc aggccatagc tcagaaagaa ctctgcgtac aacttctttg 60 aaaatctctg ctccaaaact tcaagctttc tagtaaatta ggactccttt acgaatgctt 120 accttgctta aagtatacct tgaaaagtag gacaaaagca tctcccttgc atttaacccc 180 accatcatac tgacactgct catttacaaa atgaa 215 187 500 DNA Homo sapiens misc_feature (1)...(500) n = A,T,C or G 187 gcgatgtccg acgaggaagc gaggcagagc ggaggctcct cgcaggccgg cgccgtgact 60 gtcagcgacg tccaggagct gatgcggcgc aaggaggaga tagaagcgca gatcaaggcc 120 aactatgacg tgctggaaag ccaaaaaggc attgggatga acgagccgct ggtggactgt 180 gagggctacc cccggtcaga cgtggacctg taccaagtcc gcaccgccag gcacaacatc 240 atatgcctgc agaatgatca caaggcagtg atgaagcagg tggaggaggc cctgcaccag 300 cttgcacgct cgcgacaagg agaagcaggc ccgggacatg gctgaggccc acaaagaggc 360 catgagccgc aaactgggtc aagaatgaga gccanggccc tccacnggcc ttcgccaaag 420 tgaacagcat tcagccccng nttcccaacc aagcatcgcg ggtctgcaag tgggattgaa 480 tgaanatttg tggaagtttc 500 188 974 DNA Homo sapiens 188 gggagaggca gcgtctgcgg gaggcaggcc ttgtggccca gcacccgcct gccaggcgct 60 cgggggccga actggccctg gactacctct gcagatgggc ccaaaagcac aagaactgga 120 ggtttcagaa gacgaggcag acgtggctcc tgctgcacat gtatgacagt gacaaggttc 180 ccgatgagca cttctccacc ctgctggcct acctggaggg gctgcagggc cgggcccgag 240 agctgacggt gcagaaggcg gaagccctga tgcgggagct ggatgaggag ggctctgatc 300 cccccctgcc ggggagggcc cagcgcatcc gacaggtgct gcagctgctc tcctagtggg 360 ttcagcgcgg ggcggggccg ctgcccagtg cagggctgcc tcagaccaca cagggtgcag 420 ctcctccggc ggtgggggcc gggttcacca gcagggcagc ggctgagcaa gggctttcag 480 ctcctccggt ggtgggggcc gggatcacca gcaccagagc ctcgcaaggg ccccttccct 540 cctccagacc ctccttggcc ggtgacggct gtgacagtga tggcaggttc agtgccttca 600 gcgcagagcg tggatgctct ggaatcaccc ggacccctgg ccttggaggg accctccagc 660 cccaggaatc tgctttggag ggaaatgtct atttttctac cgggaatatt ttagagattg 720 gggcatgctg gctcctcccg ccagctgcaa acctgcacct tccgcctgat tcccgatccc 780 cctgcgtggg ccgcattcct ggtcccctgc ctgcgtccat cgaggggcct ggctgtggcc 840 tgttttcctt tgaccccaca cagcgtcatt gcgggtcatg gggagcccct ggtgggagct 900 tgtggagtcg gatcacgtac ctgtgcagaa accgcctctg tggctgcatt tgaaataaaa 960 cccgacccag cagc 974 189 499 DNA Homo sapiens misc_feature (1)...(499) n = A,T,C or G 189 gcttcagtgt cgccgccatc tttgttgatg tgtcagctcc gcaggggttt ggggaaacgg 60 ccgctgagtg aggcgtcggc tgtgtttctc accgcggtct tttcctccca ctcttggctg 120 gttggacccc gctatggaaa agttggcccc tgagccagag ctccagcagc cttgttaggg 180 cgtggcctga ggcttggata agtgggatgt aaaacgaaga tcaggagcag atttgaagaa 240 ttacaaagtg aattggtgcc agtcagcatg tcagagacag accacatagc ctctacttcc 300 tctgataaaa atgttgggaa aacacctgaa ttaaaggaag actcatgcaa cttgttttct 360 ggcaatgaaa gcagcaaatt agaaaatgan gtccaaacta ttgtcattaa acactgataa 420 aactttatgt caacctaatg agcataataa atcgaattga anccccanga aaaattatat 480 tccanancat nggtggaag 499 190 571 DNA Homo sapiens misc_feature (1)...(571) n = A,T,C or G 190 tgaatccaat tctttgcagt cacaatttga taaagtttcc tgtagtgaaa gtcagttaca 60 aagccagtgt gaacaaatga aacagccaaa tattaatttg gaaagtaggt tgttgaaaga 120 ggaagaactg cgaaaagagg aagtccaaac tntgcaagct gaactcgctt gtagacaaac 180 agaagttaaa gcattgagta cccaggtaga agaattaaaa gatgagttag taactcagag 240 acgtaaacat gcctytagta tcaaggatyt caccaaacaa cttcagcaag cacgaagaaa 300 attagatcag gttgagagtg gaagctatga caaagaagtc agcagcatgg gaagtcgttt 360 tagttcatca gggtccctga atgctcgaag cagtgcagaa gatcgatytc cagaaaatac 420 tgggtcctca gtagctgtgg ataactttcc ncaagtagat aaggccatgt tgattgagag 480 aatagttagg ctgcaaaaag cacatgcccg gaaaaatgaa aagatagaat ttatggagga 540 ccncatcaaa caactggtgg aagaaattag g 571 191 518 DNA Homo sapiens misc_feature (1)...(518) n = A,T,C or G 191 ggaaagaact tgaggctgcc agtgctccag aggagaggac aaggcttcat gatgaactgg 60 aagaagccaa ggacaaagcc cggcggagat ccnttggcaa catcaagttt attggagaac 120 tctttaaact caaaatgctg actgaagccn tcatgcatga ctgngtggtg aagctgctaa 180 agaaccatga tgaagaatcc ctggagtgcc tgtgtcgcct gctcaccacc attggcaaag 240 acttggactt tnaaaaanca aagccacnta tggaccagta ctttaatcag atggagaaaa 300 ttgtgaaaga aagaaaaacc tcatctanga ttcggntcat gcttcaagat gttatagacc 360 taaggctgtg caattgggta tntcnaanag cagatcaagg gcctaaaact atcgaacaga 420 ttnacaaaga agctctnatn gannaacaag aagagctaag ganggtccat ccactcatga 480 ccnaaganaa gagaanaccn cgtgttcnga gagtggac 518 192 478 DNA Homo sapiens misc_feature (1)...(478) n = A,T,C or G 192 gggaagacgg agctggctgc cngntccana ggcccatgag gggatgcagt tatgggctct 60 gtcgccgtgg attgttattt tgtgtcagta agtaatccat aaagtgccaa catgggaaag 120 aaacggacaa agggaaaaac tgttccaatc gatgattcct ctgaaacttt agaacctgtg 180 tgcagacaca ttagaaaagg attggaacaa ggtaatttga aaaaggcttt agtgaatgtg 240 gaatggaata tctgccaaga ctgtaagact gacaataaag tgaaagataa agctgaagaa 300 gaaacagaag aaaagccttc agtttggctg tgtcttaaat gtggccatca gggctgtggc 360 agaaattctc aggagcagca tgccttgaag cactatctga cgccaagatc tgaacctcac 420 tgtctggttc ttagtttgga caactggagt gtatgggngt taccntatgc gataatga 478 193 566 DNA Homo sapiens misc_feature (1)...(566) n = A,T,C or G 193 ttttcgcgca gccagccttg ggaagcccag gnnnggtntn catggnggtg gaaggaggaa 60 tgaaatgtgt gaagttcttg ctctacgtcc tcctgctggc cttttgcgcc tgtgcagtgg 120 gactgattgc cgtgggtgtc ggggcacagc ttgtcctgag tcagaccata atccaggggg 180 ctacccctgg ctctctgttg ccagtggtca tcatcgcagt gggtgtcttc ctcttcctgg 240 tggcttttgt gggctgctgc ggggcctgca aggagaacta ttgtcttatg atcacgtttg 300 ccatctttct gtctcttatc atgntggtgg aggtggcccg cagccattgc tggctatgtg 360 tttagagata aggngatgtc aagagtttaa taacaacttc cggcangcag atggagaatt 420 accccggaan aacaaccaca ctgcttcgat cctggacagg atgcagggca gaattttaan 480 ngctggtggg gnctggtaac tacacaggat tgggagaaaa atccccttcc atgtcggaag 540 gaaccgnatc ccccgactcc ctgctg 566 194 497 DNA Homo sapiens 194 gctctgatac tcctaccacc cttgccagcc atagcaccaa gactgatgcc agtagcactc 60 yaccatagcac ggtacctcct ctcacctcct ccaatcacag cacttctccc cagttgtcta 120 yctggggtctc tttctttttc ctgtcttttc acatttcaaa cctccagttt aattcctctc 180 ytggaagatcc cagcaccgac tactaccaag agctgcagag agacatttct gaaatgtttt 240 ytgcagattta taaacaaggg ggttttctgg gcctctccaa tattaagttc aggccaggat 300 yctgtggtggt acaattgact ctggccttcc gagaaggtac catcaatgtc cacgacgtgg 360 yagacacagtt caatcagtat aaaacggaag cagcctctcg atataacctg acgatctcag 420 yacgtcagcgt gagtgatgtg ccatttcctt tctctgccca gtctggggct ggggtgccag 480 ygctggggcat cgcgctg 497 195 503 DNA Homo sapiens misc_feature (1)...(503) n = A,T,C or G 195 gcgcccgccg ccgccagccg ggcttncncg ggagagcagg gaagagaaac tttgcctttt 60 attgttttta gtccttaagt gcaaggaact ctgtgttggg aggaaaaatg tccttcttca 120 atttccgtaa gatcttcaag ttggggagcg agaagaagaa gaagcagtac gaacacgtga 180 agagggacct gaaccccgaa gacttttggg agattatagg agaactgggc gacggagcct 240 ttgggaaagt gtacaaggcc cagaataaag agaccagtgt tttagctgct gcaaaagtga 300 ttgacactaa atctgaagaa gaacttgaag attacatggt agagattgac atattagcat 360 cttgtgatca cccaaatata gtcaagcttc tagatgcctt ctattatgag aacaatcttt 420 ggatcctcat tgaattttgt gcaggtggag cagtagatgc tgtgatgctt gaacttgaga 480 gaccattaac tgagtcccaa ata 503 196 665 DNA Homo sapiens 196 gcctcgtgcc gaatcggcac gagggagacg ggggagcggg gggctcgtct gttccaggag 60 ccctgaacca aagagcagcg gagtttgaga agccagcagc tcggggttcg gcagcagcgg 120 tcccatcggc tgaagttcgg ggggggtggg gcgccgagcg cgcggggtgg ggggggtcct 180 ggtctttggc ttctcgactc ggtcctgttt cgacagcgaa catgtcgcgg cctgtcagaa 240 ataggaaggt tgttgattac tcacagtttc aggaatctga tgatgcagat gaagattatg 300 gaagagattc gggccctccc actaagaaaa ttcgatcatc tccccgagaa gctaaaaata 360 agaggcgatc tggaaagaat tcacaggaag atagtgagga ctcagaagac aaagatgtga 420 agaccaagaa ggatgattct cactcagcag aggatagtga agatgaaaaa gaagatcata 480 aaaatgtgcg ccaacaacgg caggcggcat ctaaagcagc ttctaaacag agagagatgc 540 tcatggaaga tgtgggcagt gaggaagaac aagaagagga ggatgaggca ccattccagg 600 agaattccgg cagcgatgaa gatttcctaa tggaagatga tgacgatagt gactatggca 660 gttcg 665 197 497 DNA Homo sapiens misc_feature (1)...(497) n = A,T,C or G 197 gcgccagcag ccgtccggag ccagccaacg agcggaaaat ggcagacaat ttttcgctcc 60 atgatgcgtt atctgggtct ggaaacccaa accctcaagg atggcctggc gcatggggga 120 accagcctgc tggggcaggg ggctacccag ggggcttcct atcctggggc taccccgggc 180 aggcaccccc aggggcttat cctgacaggc acctccaggc gcctaccctg gagcacctgg 240 agcttatccc ggagcacctg cacctggagt ctacccaggg ccacccagcg gccctggggc 300 ctacccatct tctggacagc caagtgcccc cggagcctac cctgccactg gcccctatgg 360 cgcccctgct gggccactga ttgtgcctta taacctgcct ttgcctgggg gagtggtgcc 420 tcgcatgctg ataacaattc tgggcacggt gaagcccaat gcaaacagaa ttgntttaga 480 tttccaaaga nggaatg 497 198 498 DNA Homo sapiens misc_feature (1)...(498) n = A,T,C or G 198 ggtcttccgt gtgtcgccgc cgttgccgcc tcagcttcag cctcgttact cctgcggttc 60 tgtggctgtg ctgctggcgt taacggcgcg aggtgaaggg aggtgacgga gtgtgcccgc 120 gcgcgcgggg gtcccctcag tcccagcagt tcccttcgtg cgcggggggc ggcgagggtc 180 ttcagcagtc gggagaggcc cttgacggcg ccatgtcggc gggcggtcca tgcccagcag 240 cagccggagg gggcccaggg ggcgcctcct gctccgtggg ggcccctggc ggggtatcca 300 tgttccggtg gctggaggtg ctggagaagg agttcgacaa agcttttgtg gatgtggatc 360 tgctcctggg agagatcgat ccanaccaag cggacatcac ttatgagggg cgacagaaga 420 tgaccancct gagctctgnt ttgcacagnt ttccacaaac ccantctgtg tctcaaatca 480 accacaanct ggaggcac 498 199 236 DNA Homo sapiens misc_feature (1)...(236) n = A,T,C or G 199 ggttggggga ggacgggttg ccgactcgcc tacctagcgg tctcttgatt gtcgatattt 60 tgttggcata ggtttatgta gagacgtata catatatata gacacactgt ctttaaatct 120 aggcctgtat ccggtgtccg aggcgaactc agtaagatga tgttaagagg aaacctgaag 180 caagtgcgca ttgagaaaaa cccggnccgc cttcgcgccc tgagtccgnn gtgggc 236 200 498 DNA Homo sapiens misc_feature (1)...(498) n = A,T,C or G 200 ggaacaaagg ggctgctccc tgcactgtca gaacttctct ggactccagc tgctaacacc 60 acatcactca acctacttta ctgcacttta cccccctccc ctgtctgttt ctattgtcca 120 ggctgaagaa atatggggac aaatcatact gatgttttca agaccacaga tgtcccatat 180 tcacaaatct gtttagttac cattaattat gaccccaatt ctgaattttt cccagctcaa 240 ttaccaccca caaaacaacc aatatgaaaa aaaaacttta cctttgatta aatttattcc 300 cttcagcang cctgatcctt tggccccctt gagccgactg ggccatctgg gatgggaagc 360 acgtgatacc agcttggacc aaagttgggg ttgcaatcaa tcnaattntt gagaattgaa 420 aagggnttna atggttatct agggggtttt cttcnattgg atnccccagg cccnattaaa 480 acggggnggn ccncaact 498 201 238 DNA Homo sapiens 201 caccccaccc atacttaaag agcaaataaa ttatatttcg aaaagatttt aaaataatct 60 caatataaaa accaacagtg ggaactgagt ctggaataaa tgtgtgcaga atcctaattc 120 ttttctgagc tccactcagt gagaatatgc accctctact gtccagcctg tggtttgcag 180 taagctcact cttctgttca gaatgaccac aggagggcct ttacctagga atagcagg 238 202 516 DNA Homo sapiens 202 gcttaccagc tcctcagaag gcggacaccg accctgagaa cttacctgag atttcgtcac 60 agaagacaca aagacacatc cagcggggac cacctcacct gcagattaga cccccaagcc 120 aagacctgaa ggatgggacc caggaggagg ccacaaaaag gcaagaagcc cctgtggatc 180 cccgcccgga aggagatccg cagaggacag tcatcagctg gaggggagcg gtgatcgagc 240 ctgagcaggg caccgagctc ccttcaagaa gagcagaagt gcccaccaag cctcccctgc 300 caccggccag gacacagggc acaccagtgc atctgaacta tcgccagaag ggcgtgattg 360 acgtcttcct gcatgcatgg aaaggatacc gcaagtttgc atggggccat gacgagctga 420 agcctgtgtc caggtccttc agtgagtggt ttggcctcgg tctcacactg atcgacgcgc 480 tggacaccat gtggatcttg ggtctgagga aagaat 516 203 634 DNA Homo sapiens 203 gaaaaaacta tagaactaat aaaatgaatt ccataaaaag tgctacgtac aaaattgaca 60 attaagtctg tagcatttct atacactaac aaacaacaat ctatccaaaa aagtaatcag 120 gaaactaatc acatttacac tatttactaa gtaaacaaac ttatgaatga atttaaccaa 180 gtaggttaaa gatctgtata gtgaaaacat tgatgaaacg aattgaagat tacaaaaata 240 gaaatttttc tttgtacatg gattgggaaa tttatattgt tataattttc atactgccaa 300 aagaaacata cagattctgt gtaatcaata tttcaatgtc ctttttattt ttattttttt 360 tacagaaata caaaaaaaat tctaaaactt gtatggaact acaaaagaca ttaaataatc 420 aaagtaatct tgagcaacaa ggacacagct ggaggcatca ctctacctga ttttgtaatc 480 cattatcaaa aaactgtggt accagcataa aaacagattc actgaccagt ggaacaggaa 540 agcccagaca taactcatac atttccatcc aattgatttt tgacaaaagt ggcacacaga 600 atggggagag gataaataaa taaataaatt gttt 634 204 428 DNA Homo sapiens 204 gcatctagat ctttcactgg gttatttgga gagaaacatc tcctaccgaa ggccagttta 60 acaagactgg tagaggaggt tatcttatct aattcacaga aatcaacaca ttgagtcaag 120 aaaaatgaaa cacagaaaca ctctaaataa aagaagataa gtcttcagaa accaacccta 180 atgaaatgaa gatatgtgat taacctgaca ggaaattaaa aagagcaacc ataaagagtc 240 tcatgggagc aatgcatgaa caaactgaga attttcataa agaaataaaa gataacaaaa 300 agtataaagt tgaagaatac aataactgga ctcaaaaatt taatgtgtcc aacagaagaa 360 tcgatgaagc agaagaaagg gctatcaaac ttgtagatag gccattggaa atcatgtaat 420 gtgaggag 428 205 684 DNA Homo sapiens 205 gcaggtctaa aggagaaccc agggattctg ggaccctcca gagccaggag gccaaggctg 60 tgaaaaagac cagtctcttt gaggaagacg aagaagatga tctttttgcc attgccaagg 120 acagccaaaa gaagacccag agagtgtcac tcctctttga agacgatgtt gatagcggag 180 gctctctgtt tggctctcct cccacatctg ttcctcctgc aacaaagaaa aaagagactg 240 tctctgaggc accacctttg ctgttcagcg atgaagaaga gaaggaggca caacttggag 300 tgaagtctgt ggataagaag gttgagagtg ccaaggagtc attaaaattt gggagaactg 360 atgtggctga gtcagaaaag gaaggacttt tgactagatc tgctcaggaa acagtcaagc 420 attctgattt attttcttca tcatccccat gggacaaagg aaccaagcct agaaccaaaa 480 ctgttcttag cttgtttgat gaggaagagg ataaaatgga agatcaaaac attatccagg 540 ctccacagaa agaagtagga aagggccgcg atcctgatgc ccaccccaag agcacaggtg 600 tcttccagga tgaagagctg cttttcagcc acaagctcca aaaggacaat gacccagatg 660 ttgacccttt tttgcttggc accc 684 206 596 DNA Homo sapiens misc_feature (1)...(596) n = A,T,C or G 206 ctttgatgat cccctgaatg cctttggagg ccagtagagc acacagggta tccacatgtt 60 accctgcagc ttacattgtt gagttagtga tgatgttgta tatgctgatg gtcttaactg 120 gattacaaaa agcaaatact agaacagcta gctcatcgtt cacccaatgt acttggtatt 180 tttctgcact ggtttaatca tgcttaatac tacaaaacaa aaataaatat ttcacagtgg 240 ttggtttgtt ttgtttttaa accacagttt gatttagtta gccttgctgg ggccataata 300 tgcttcaggg tgtgtaaaag aagaaatctc tttgtggctt tcatgggcag ggaatcccag 360 agatagcaaa tgccacctga ccagaagtct ttgttatatg gatgggaacc ctaacttagg 420 gcctgggcag gggaaagaga aagaaggtga gagattatac ttcatgagtn ttagcaatat 480 gggagcaggt tttcactgaa ttntgagggt gcctctgcat gtcctccaag gcaaagtttg 540 gcaaactgtg gcccccccca ctgtcatatt ttgttaataa aattttattg gaacnc 596 207 518 DNA Homo sapiens misc_feature (1)...(518) n = A,T,C or G 207 ggaacgctgc ggccagcagc taccccatgg cctccctgta cgtgggcgac ctgcattcgg 60 acgtcaccga ggccatgctg tacgaaaagt tcagccccgc ggggcctgtg ctgtccatcc 120 gggtctgccg cgatatgatc acccgccgct ccctgggcta tgcctacgtc aacttccagc 180 agccggccga cgctgagcgg gctttggaca ccatgaactt tgatgtgatt aagggaaagc 240 caatccgcat catgtggtct cagagggatc cctctttgag aaaatctggt gtgggaaacg 300 tcttnatcaa gaacctggac aaatctatag ataacaaggc actttatgat actttttctg 360 cttttggaaa catactgtcc tgcaaggtgg tgtgtgatga gaacggctct aagggttatg 420 cctttgtcca cttcgagacc caagaggctg ccgacaaggc catcgagaag atgaatggca 480 tggctcctca atgaccgcaa agtatttgtg ggcagatt 518 208 612 DNA Homo sapiens misc_feature (1)...(612) n = A,T,C or G 208 gattacgagc tcccccagcc agcagtcagt attggattgg ccttgcctgg ctggtggcag 60 tttgggagaa acagccaaag aggttagttt atttcaacac caaattagac catggcccat 120 ttccaacggg tctctttaaa ggcctctatg gaatacattg cctggtttcc ttctttgcag 180 ttcaccacag caaggaatgt cacctccatc ccaagccacc attttctcat gaaggcaaat 240 ccaagaaggg cttgcagttc ttgctgaagg gggtaccatt tgtgggcaga gtgatcaata 300 ccatctagtt gggggaggag gagcttattt cttggtgtac ttgaatcaga aggtccctgc 360 aagccagtat gcttcatttg ccagtggcca gaaattctcc cttgcctcct tgattgaggt 420 gtcccagatg tagtattccc acaggggtct ggcaggcccc tcctgtaacc actccagtca 480 cattttctgc tcttgaggca gaggtgacat caggacgttt acagcctcca catgaattga 540 gtgttcattt acctcagtat taccgtgttc atttttgtcc tcgtgctaca gttagctncc 600 tgccgctctt gc 612 209 611 DNA Homo sapiens misc_feature (1)...(611) n = A,T,C or G 209 gttggtgcga ggagccgcgg ggctgtgctc ggcggtnaag gggacagcgc gtgggtggcc 60 gaggatgctg cggggcggta gctccggcgc ccctcgctgg tgactgctgc gccgtgcctc 120 acacagccga ggcgggctcg gcgcacagtc gctgctccgc gctcgcgccc ggcggcgctc 180 caggtgctga cagcgcgaga gagcgcggcc ctcaggagca aggcgaatgt atgacaacat 240 gtccacaatg gtgtacataa aggaagacaa gttggagaag cttacacagg atgaaattat 300 ttctaagaca aagcaagtaa ttcaggggct ggaagctttg aagaatgagc acaattccat 360 tttacaaagt ttgctggaga cactgaagtg tttgaagaaa gatgatgaaa gtaatttggt 420 ggaggagaaa tcaaacatga tccggaagtc actggagatg ttggagctcg gcctgagtga 480 ggcacaggtt atgatggctt tgtcaaatca cctgaatgct gtggagtccg agaagcanaa 540 actgcgtgcg caggttcgtc gtctgtgcca ngaaaatcaa tggctacngg atgaactggc 600 caacacgcag c 611 210 504 DNA Homo sapiens misc_feature (1)...(504) n = A,T,C or G 210 ggtcgttttc ccgtccccga gagcnagttt atttacaaat gttggagtaa taaagaaggc 60 agaacaaaat gagctgggct ttggaagaat ggaaagaagg gctgcctaca agagctcttc 120 agaaaattca agagcttgaa ggacagcttg acaaactgaa gaaggaaaag cagcaaaggc 180 agtttcagct tgacagtctc gaggctgcgc tgcagaagca aaaacagaag gttgaaaatg 240 aaaaaaccga gggtacaaac ctgaaaaggg agaatcaaag attgatggaa atatgtgaaa 300 gtctggagaa aactaagcag aagatttctc atgaacttca agtcaaggag tcacaagtga 360 atttccagga aggacaactg aattcaggca aaaaacaaat agaaaaactg gaacaggaac 420 ttaaaaggtg taaatctgag cttgaaagaa gccaacaagc tgcgcagtct gcagatgtct 480 ctctgaatcc atgcaataca ccac 504 211 611 DNA Homo sapiens misc_feature (1)...(611) n = A,T,C or G 211 gtngaaaaga tgttctctat tagactgtga cctcaannng tcacagcaga aaataaatga 60 gctccttaaa cagaaagatg tgctaaatga ggatgttaga aacctgacat taaaaataga 120 gcaagaaact cagaagcgct gccttacaca aaatgacctg aagatgcaaa cacaacaggt 180 taacacacta aaaatgtcag aaaagcagtt aaagcaagaa aataaccatc tcatggaaat 240 gaaaatgaac ttggaaaaac aaaatgctga acttcgaaaa gaacgtcagg atgcagatgg 300 gcaaatgaaa gagctccagg atcagctcga agcagaacag tatttctcaa ccctttataa 360 aacacaagtt agggagctta aagaagaatg tgaagaaaag accaaacttg gtaaagaatt 420 gcagcagaag aaacaggaat tacaggatga acgggactct ttggctgccc aactggagat 480 caccttgacc aaagcagatt ctgagcaact ggctcgttca attgctgaag aacaatattc 540 tgatttggaa aaagagaaga tcatgaaaga gctggagatc aaagagatga tggctagaca 600 caaacaggac t 611 212 516 DNA Homo sapiens 212 gccaggattc ccgatccaga gacaatggcc ccgatgggat ggagcccgaa ggcgtcatcg 60 agagtaactg gaatgagatt gttgacagct ttgatgacat gaacctctcg gagtcccttc 120 tccgtggcat ctacgcctat ggttttgaga agccctctgc catccagcag cgagccattc 180 taccttgtat caagggttat gatgtgattg ctcaagccca atctgggact gggaaaacgg 240 ccacatttgc catatcgatt ctgcagcaga ttgaattaga tctaaaagcc acccaggcct 300 tggtcctagc acccactcga gaattggctc agcagataca gaaggtggtc atggcactag 360 gagactacat gggcgcctcc tgtcacgcct gtatcggggg caccaacgtg cgtgctgagg 420 tgcagaaact gcagatggaa gctccccaca tcatcgtggg tacccctggc cgtgtgtttg 480 atatgcttaa ccggagatac ctgtccccca aataca 516 213 268 DNA Homo sapiens misc_feature (1)...(268) n = A,T,C or G 213 tgcagaagag aaaagttcct gaagtgacag agaaaaagaa caaaaagctg aagaaggcgt 60 cagcagaggg gccactgctg ggccctgagg ctgcaccaag tggcgaagga gccggctcca 120 agggcgaagc tgtgctcagg cccgggctgg acgcagagcc agagctgtcc ccagaggagc 180 agagggtcct gnaaaggaag ctgaaaaagg aacggaagaa agaggagagg cngcgtctgc 240 gggaggcang ccttgtggcc cagcaccc 268 214 138 DNA Homo sapiens misc_feature (1)...(138) n = A,T,C or G 214 gccagaatgc accccttgtg tcccgtgccc ctcagacctt caagatggat gacctcctgg 60 ctgagatgca gcagattgag cagtcaaact tccgccaggc tccccagaga gcccctggtg 120 tggcagactt gnccctct 138 215 517 DNA Homo sapiens misc_feature (1)...(517) n = A,T,C or G 215 gcaagcgcac acgggaagac gggccctgga ttccttacca gtactacagt ggttcctgtg 60 agaacaccta ctccaaggca aaccgcggct tcatcaggac aggaggggac gagcagcagg 120 ccttgtgtac tgatgaattc agtgacattt ctcccctcac tgggggcaac gtggcctttt 180 ctaccctgga aggaaggccc agcgcctata actttgacaa tagccctgtg ctgcaggaat 240 gggtaactgc cactgacatc agagtaactc ttaatcgcct gaacactttt ggagatgaag 300 tgtttaacga tcccaaagtt ctcaagtcct attattatgc catctctgat tttgctgtag 360 gtggcagatg taaatgtaat ggacacgcaa gcgagtgtat gaagaacgaa tttgataagc 420 tggtgtgtaa ttgcaaacat aacacatatg gagtagactg tgaaaagtgt cttcctttct 480 tcaatgaccg gccgtggagg aggggcaact gcnggaa 517 216 516 DNA Homo sapiens misc_feature (1)...(516) n = A,T,C or G 216 gcatgcgcac atgggccccc ttcgtctcag ctgtgcggga acggccgagg gtaacatccc 60 gggctcgcgg gaggctgtcg gggtaatggc cacacgctga cagaaccagc cgagtggaaa 120 aggggagcga agccgttcct ctgcaccctt ccccaggcct gaggccttcc cgcttggtgc 180 tgccgccgcc actgccggct gaggaggggc gatgagttgg ttcaacgcct cccagctctc 240 cagcttcgct aagcaggccc tgtcccaggc ccagaagtct attgacaggg ttctggacat 300 ccaggaagag gagccgagca tctgggccga gaccattccg tatggagagc cgggaataag 360 ttcccctgtc agtggaggat gggatacttc aacctggggg ttgaaatcaa acactgaacc 420 tcanagtcca ccaatagcct atcctaaagc aatcacaaag ccagttcnga ngactgnggn 480 cnatgaatct gaaaatttct tcagngcctt tctctc 516 217 612 DNA Homo sapiens misc_feature (1)...(612) n = A,T,C or G 217 gangagatgc ggcgcaggag ctgtcgctgt gtttgcttta acctgagtct tgttccttat 60 tgtggttcct gctgtggttt tgatcatgtt gttaccctcg gacgtagccc ggcttgtatt 120 gggttactta cagcaagaaa acctcatttc tacctgccag acttttattt tggaaagttc 180 agatttaaaa gaatatgcag aacattgtac agatgaaggg tttattccag cctgcttact 240 gtccttattt ggaaaaaact tgacaacaat tttaaatgag tatgtagcta tgaaaacaaa 300 agaaacatca aataatgtcc cagcaataat gtcatctcta tggaagaaat tggaccatac 360 actttctcag atcaggagca tgcaaagttc cccaaggttt gctggcagtc agagagcccg 420 aacgagaact ggaattgcag aaatcaaacg gcagagaaag cttgcatctc aaacagctcc 480 agccagtgca gagttgctca ctttacctta cctttcagga cagtttacca ctcctccttc 540 cacaggtaca caggttactc gaccaagtgg ccaaatttca gatccatcga ggtcatattt 600 tgtaagtggt ca 612 218 226 DNA Homo sapiens misc_feature (1)...(226) n = A,T,C or G 218 gcaaggcttc cctgtgcatc agnaccaaaa aggaggtgga aaagatgaat aagaagatgg 60 aggaagtaaa ggaagccaac atccgagttg tgtctgagga cttcctccag gacgtctccg 120 cctccaccaa gagccttcag gagttgttct tancgcacat cttgtcccct tggggggcag 180 aggtgaaggc agancctgtt gaagttgtgg ncccaagagg gaagnc 226 219 522 DNA Homo sapiens 219 gaattcggca cgagggctaa gcagctgcca gggggcagaa ttcacttctc tggttatgac 60 aatgaccgac caggcaattt ggtgtatcgc ttctgtgacg tcaaagacga gacctatgac 120 ttgctctacc agcaatgcga tgcccagcca ggggccagcg ggtctggggt ctatgtgagg 180 atgtggaaga gacagcagca gaagtgggag cgaaaaatag tttcttttcc aaaggttgtt 240 gctctacttt gtaggaagtc tttgcatatg gccctcccaa ctttaaagtc ataccagagt 300 ggccaagagt gtttatccca acccttccat ttaacaggat ttcactcaca tttctggaac 360 tagctatttt tcagaagaca ataatcaggg cttaattaga acaggctgta tttcctccca 420 gcaaacagtt gtggccacac taaaaacaat catagcattt tacccctgga ttatagcaca 480 tctcatgttt tatcatttgg atggagtaat ttaaaatgaa tt 522 220 331 DNA Homo sapiens 220 gcaatgagat tgagagtatc tatgatgaga ggcaagagca ggctacggag gagaatgctg 60 gtgtacctgt tggcccacac ctctcacttg cgtatgaaga caaacaaata ctggaagatg 120 ctgctgctct gattatccac catgtgaaga ggcagacagg cattcagaag gaggacaaat 180 ataagataaa acaaatcatg catcatttta ttccagattt gctctttgcc caaagaggtg 240 atctctcaga tgtggaggaa gaggaagaag aagagatgga tgtagatgaa gccacagggg 300 cagttaagaa gcacaatggt gttgggggca g 331 221 536 DNA Homo sapiens 221 ggcttgcagt gagccaagat cgtgccactg cactccagcc tgggcgacag agcaagactc 60 ycgtctcaaaa aataaaaaaa aaaaaaaaat aggtgaaaat tccttataaa tccaggattg 120 gctctgagag aactggctaa gattcaggaa gaaacaaaaa attcagaatc ctacaaggtt 180 yttgatgacaa ttagggccaa aattttagga ggagatgtag gatgcaggag aaaattaaag 240 tgttttcttt atatcagagg aggaaatagt agaggtcagt gaaggtctgg ggtagggaaa 300 cattcagact gtccattgca tggctgtgga gtgagactgc ccttagcctg ggtcagcctt 360 cctgggccat aaattgggca tccgtgatgc taggtaactg tgggaacaaa atgacagctt 420 agagcagcca tgggtgatgt ttggtggtaa aaaacctaca ggcgtttggg gtcccatgat 480 tgttccagac catgactctt cctggttgtg ggtttgttac agagcaggag aagcag 536 222 576 DNA Homo sapiens 222 tttagttgag tctgactcac ttgctttggt tcctgtgtat tttactaccc ctcttgtcag 60 tgaccttcct tccccacccc acccagagtg aatttgtagc atgattgtat aaacctctat 120 gtagaaaatg gagatttctt gctctgaaat gttaagctct aactgatcca tttctgtgtc 180 ctttagccta gtatgtctga acttccattc ttgttatata tttaaacttt ccctctatat 240 tataggtttt gtggcatcca cggtcaggtg tagaggaagc tgccccttgc agaactgtac 300 tgtaatattt ttcttttata aatattttca caggactgat tgtacacagg gcttgtaata 360 aaattttaac actgtgctgt gaaacaacta tggggaatct ccattgaagg ctacttcatg 420 ggcacctgaa agtggagtgt tatagctatg actttctatt tcttgtttcc taagtaaatt 480 aaacctaatt ttcacccttt cattctgttt cagcctcctg tataagaagt accgtatttt 540 ctgcccatca tactttgtaa taaaacttga acatgt 576 223 159 DNA Homo sapiens misc_feature (1)...(159) n = A,T,C or G 223 gcggagcata atcncccgag cagnggtgac aggaacctgg agcgaagatg cagccccaac 60 ctctcccgag aggtgctcta cgaaatcttt cgctccctac acaccctggt tggacagctt 120 gacctnaaan atgatgtggn gaaaatgana anctgttgt 159 224 312 DNA Homo sapiens misc_feature (1)...(312) n = A,T,C or G 224 gggttgctca agaaacagat gtgagagccc agattcgtct gcaatttcgt gatgtcaatg 60 gagaacttat agctgtgcaa agatctatgg tgtgtactca gaaaagcaaa aagacagaat 120 ttaaaactct ggaaggagtc attactagaa caaagcatgg tgaaaaggtc agtctgagct 180 ctaagtgtgc agaaattgac cgagaaatga tcagttctct tggggtttcc aaggctgtgn 240 taaatantgt natttnctgt catcnnagaa nattctnntt ggcctttang tgganggaaa 300 ngctttgang gt 312 225 563 DNA Homo sapiens misc_feature (1)...(563) n = A,T,C or G 225 gcccagctct ccccagtcct gtctaggtcc ccgtcggggg acagaccagc agtaggacac 60 agtggcttga agcccctctc tggggagacc ccgctctcgg cttacaaagt gaagagccgc 120 accaagatca tccggagacg cggcagcaca agccttcctg gagacaagaa aagcggcacc 180 tcacctgccg ccaccgccaa gagccacctc agcctccggc ggagacaggc cctcaggggg 240 aagagcagcc ctgtcctgaa gaagaccccc aacaagggcc tggtacaggt caccacgcac 300 cgactatgtc gcctgccacc gagccgggcc cacctcccca ccaaggaagc gtccagcctg 360 catgccgtgc ggactgcacc caccagcaag gtgatcaaga cccgctaccg cattgtcaag 420 aagacgccgg cctcgcctct cagcgccccg cccttccccc tgtctcttgc cctcctggcg 480 ggcccggcgg ctcttactat ccangtcccc tnggtgcttg aacccgcctg gcgtccaagt 540 ttgccaaccg ggggggtngg gga 563 226 564 DNA Homo sapiens 226 gcccaaaccc actccacctt actaccagac aaccttagcc aaaccattta cccaaataaa 60 gtataggcga tagaaattga aacctggcgc aatagatata gtaccgcaag ggaaagatga 120 aaaattataa ccaagcataa tatagcaagg actaacccct ataccttctg cataatgaat 180 taactagaaa taactttgca aggagagcca aagctaagac ccccgaaacc agacgagcta 240 cctaagaaca gctaaaagag cacacccgtc tatgtagcaa aatagtggga agatttatag 300 gtagaggcga caaacctacc gagcctggtg atagctggtt gtccaagata gaatcttagt 360 tcaactttaa atttgcccac agaaccctct aaatcccctt gtaaatttaa ctgttagtcc 420 aaagaggaac agctctttgg acactaggaa aaaaccttgt agagagagta aaaaatttaa 480 cacccatagt aggcctaaaa gcagccacca attaagaaag cgttcaagct cacacccact 540 acctaaaaaa tcccaaacat ataa 564 227 563 DNA Homo sapiens misc_feature (1)...(563) n = A,T,C or G 227 gatcatccac gacccgggcc gcggcgcncc cctcgccaag gtggtcttcc gggatccgta 60 tcggtttaag aagcggacgg agctgttcat tgccgccgag ggcattcaca cgggccagtt 120 tgtgtactgc ggcaagaagg cccagctcaa cattggcaat gtgctccctg tgggcaccat 180 gcctgagggt acaatcgtgt gctgcctgga ggagaagcct ggagaccgtg gcaagctggc 240 ccgggcatca gggaactatg ccaccgttat ctcccacaac cctgagacca agaagacccg 300 tgtgaagctg ccctccggct ccaagaaggt tatctcctca gccaacagag ctgtggttgg 360 tgtggtggct ggaggtggcc gaattgacaa acccatcttg aaggctggcc gggcgtacca 420 caaatataag gcaaagagga actgctggcc acgagtacgg ggtgtggcca tgaatccttg 480 tggagcatcc ttttggaggt ggcaaccacc agcacatcgg naagccctcc accatccgca 540 gagatgcccc tgctggccgc aaa 563 228 564 DNA Homo sapiens misc_feature (1)...(564) n = A,T,C or G 228 gcctcgtgcc gaaacagaca tgtttaatga tagcttgctc ttcacagaga tgtctacaga 60 gacttttaat ctataatcca ggagtatata accatgcagc acagaccaat tagccaaatg 120 caaaataaac tagattctta ccacaactat cctataaaca ctgcaactat tctttccaaa 180 aggaccctaa tcttatgtga aaacacctac tgtggggaga atcccggaca gccctgctcc 240 ctgcagccag gtgtagtttc gggagccact ggggccaaag tgagagtcca gcggtcttcc 300 agcgcttggg ccacggcggc ggccctggga gcagaggtgg agcgacccca ttacgctaaa 360 gatgaaaggc tggggttggc tggccctgct tctgggggcc ctgctgggaa ccgcctgggc 420 tcggaggagc caggatctcc actgtggagg gctctggtgg atgaactaga atgggaaatt 480 gccangtgga ccccaagaag accattcaga agggatcttt ccngatcaaa tccagatggc 540 anccagtcag tggtggangn gcct 564 229 563 DNA Homo sapiens 229 gcaacttcgg atttcactct acccggagag tttcccgctt ggttgaacac attggcctca 60 ggaagcttcc ttcaaaatgt ctactgttca cgaaatcctg tgcaagctca gcttggaggg 120 tgatcactct acacccccaa gtgcatatgg gtctgtcaaa gcctatacta actttgatgc 180 tgagcgggat gctttgaaca ttgaaacagc catcaagacc aaaggtgtgg atgaggtcac 240 cattgtcaac attttgacca accgcagcaa tgcacagaga caggatattg ccttcgccta 300 ccagagaagg accaaaaagg aacttgcatc agcactgaag tcagccttat ctggccacct 360 ggagacggtg attttgggcc tattgaagac acctgctcag tatgacgctt ctgagctaaa 420 agcttccatg aaggggctgg gaaccgacga ggactctctc attgagatca tctgctccag 480 aaccaaccag gagctgcagg aaattaacag agtctacaag gaaatgtaca agactgatct 540 ggagaaggac attatttcgg aca 563 230 563 DNA Homo sapiens 230 gcaaacctgt gcttatggct cttgcggagg gtcctggtgc tgagggtccc cgcctggcct 60 cacctagtgg cagcaccagc tcagggctgg aggtggtggc tcctgagggt acctcagccc 120 caggtggtgg cccgggaacc ctggatgaca gtgccaccat ttgccgtgtc tgccagaagc 180 caggcgatct ggttatgtgc aaccagtgtg agttttgttt ccacctggac tgtcacctgc 240 cggccctgca ggatgtacca ggggaggagt ggagctgctc actctgccat gtgctccctg 300 acctgaagga ggaggatggc agcctcagcc tggatggtgc agacagcact ggcgtggtgg 360 ccaagctctc accagccaac cagcggaaat gtgagcgtgt actgctggcc ctattctgtc 420 acgaaccctg ccgccccctg catcagctgg ctaccgactc caccttctcc ctggaccagc 480 ccggtggcac cctggatctg accctgatcc gtgcccgcct ccaggagaag ttgtcacctc 540 cctacagctc cccacaggag ttt 563 231 563 DNA Homo sapiens misc_feature (1)...(563) n = A,T,C or G 231 gcatgatctc ggctcactgt aagctcctnc ttctgggttc acaccattct cctgcctcag 60 cttcccgggt ggctgggact acaggcgtcc gccaccacct caggctattt tgtatttttg 120 gtagagacgg ggtttcaccg tgttagctag gatggtctcg atctcctgac ctcgtgatcc 180 gcccacctcg gcctcccaaa gtgctgggac tacaggcatg agccaccacg cccggccaaa 240 catgggtgaa tttcttttaa aacctgaaaa taaagaattt tttttctctg atgcttcttg 300 aagtgtacat gtttatcttt tagtctggca aactcacttc taagaattaa cctcagagat 360 aagctgtcaa aaatacaaaa tggcagagat taagaacaat gattctcaga ctcgagcttg 420 cgtcagaatc acctacagga cgtatggtgc agattcctgg gcctcacttc agagtatctg 480 attcagtaga tctgggcttc cacccaataa ttttcatttc taatagattt ttaggtagta 540 atgatgttgt tctgggtacc aca 563 232 563 DNA Homo sapiens 232 gcagcggcgg ctgggagagc ggtcggcggg gtttcttcgt tgcattgcct gagaggagcg 60 gagtctgcca ggtggtgtcc atcatgttct ctttcaacat gttcgaccac cctattccca 120 gggtcttcca aaaccgcttc tccacacagt accgctgctt ctctgtgtcc atgctagcag 180 ggcctaatga caggtcagat gtggagaaag gagggaagat aattatgcca ccctcggccc 240 tggaccaact cagccgactt aacattacct atcccatgct gttcaaactg accaataaga 300 attcggaccg catgacgcat tgtggcgtgc tggagtttgt ggctgatgag ggcatctgct 360 acctcccaca ctggatgatg cagaacttac tcttggaaga aggcggcctg gtccaggtgg 420 agagcgtcaa ccttcaagtg gccacctact ccaaattcca acctcagagc cctgacttcc 480 tggacatcac caaccccaaa gccgtattag aaaacgcact taggaacttt gcctgtctga 540 ccaccgggga tgtgattgcc atc 563 233 611 DNA Homo sapiens misc_feature (1)...(611) n = A,T,C or G 233 tttgaagaaa tgaacagaat aagaacatta gnngacnntg catacagctg tgatccaagg 60 ataaaaaagt tcaaggaaga agaaaaagcc aagaaagaag cagaaaagaa agcaaaagca 120 gaagctaaac ggaaggagca agaagctaaa gaaaaacaaa gacaagctga attagaagct 180 gctcggttag ctaaggagaa agaagaggag gaagtcagac agcaagcatt gctggcaaag 240 aaggaaaaag atatccagaa aaaagccatt aagaaggaaa ggcaaaaact tcgaaactca 300 tgcaagacct ggaatcattt ttctgataat gaggcagagc gggttaaaat gatggaagaa 360 gtggaaaaac tttgtgatcg gcttgaactg gcaagcttac agtgcttgaa tgaaacactc 420 acatcatgca caaaagaagt aggaaaggct gctttggaaa aacagataga agaaataaat 480 gagcaaatca gaaaagagaa agaggaagct gaggctcgta tgcgacaagc atctaagaac 540 acagagaaat caactggtgg aggtggaaat ggaagtaaaa attggtcaga agatgatcta 600 caattactaa t 611 234 564 DNA Homo sapiens 234 gagataaaga gaagaaagag aacaagaaaa tggctgatga ggatgccttg aggaagatcc 60 gggcagtgga ggagcagata gaatacctac agaagaagct agccatggcc aagcaggaag 120 aagaagcact cctctctgaa atggatgtca caggccaggc ctttgaagac atgcaggagc 180 aaaatatccg tttgatgcag caattgcggg agaaggatga tgcaaatttc aagctcatgt 240 cagagcgtat caagtccaat cagatccata agttgcttaa agaagagaag gaggagctgg 300 cagaccaggt gttgactctg aagactcagg ttgatgccca gctacaggta gtaaggaaac 360 tggaagagaa ggagcatctg ttacagagca acattggcac aggggagaaa gagctgggtc 420 ttaggaccca agccttagag atgaataaac gcaaggcaat ggaggcagcc cagcttgcag 480 atgacctcaa agcacaactg gagttggctc agaagaagct acatgatttt caggatgaga 540 tcgtggagaa cagtgttacc aaag 564 235 563 DNA Homo sapiens 235 gcttggatag agaatattta gccatttacc taaaaatggt attttttaca tgcaatgcat 60 gtggtgaatc agtgaagaaa atacaagtgg aaaagcatgt gtctgtttgc agaaactgtg 120 aatgcctttc ttgcattgac tgcggtaaag atttctgggg cgatgactat aaaaaccacg 180 tgaaatgcat aagtgaagat cagaagtatg gtggcaaagg ctatgaaggt aaaacccaca 240 aaggtgacat caaacagcag gcgtggattc agaaaattag tgaattaata aagagaccca 300 atgtcagccc caaagtgaga gaacttttag agcaaattag tgcttttgac aacgttccca 360 ggaaaaaggc aaaatttcag aattggatga agaacagttt aaaagttcat aatgaatcca 420 ttctggacca ggtgtggaat atcttttctg aagcttccaa cagcgaacca gtcaataagg 480 aacaggatca acggccactc ccccagtggc aaatccacat gcagaaatct ccaccaaggt 540 tccagcctcc aaagtgaaag acc 563 236 563 DNA Homo sapiens 236 gctcttccgc cctgctgcag gcggaggtgc tggatctgga cgaggacgag gacgacctgg 60 aggtgttcag caaggatgcc tcattgatgg acatgaactc cttcagccct atgatgccaa 120 catccccttt atcaatgata aaccaaatca agtttgagga tgaaccagat ttaaaggatc 180 tcttcatcac agttgatgaa cctgaaagtc atgttactac aatagaaact ttcattacgt 240 ataggattat tactaagaca tctcgtgggg aatttgactc cagtgaattt gaagttagga 300 gacgatatca agatttcctt tggttgaagg gaaaactgga agaagcacac cccactctga 360 ttattccacc attgccagaa aagtttatag taaaaggaat ggtggaacgc tttaacgatg 420 acttcattga gacacgcagg aaggctttac ataaattttt gaaccgaatt gctgatcatc 480 caactttaac atttaatgaa gacttcaaaa tttttctcac tgcacaagct tgggaactct 540 cttctcacaa gaagcaaggt cct 563 237 493 DNA Homo sapiens misc_feature (1)...(493) n = A,T,C or G 237 ggcgatttgt ctggttaatt ccgataacga acgagactct ggcatgctaa ctagttacgc 60 tgccggcaca gtcttcacta ccgtagaaga ccttggctcc aagatactcc tcacctgctc 120 cttgaatgac agcgccacag aggtcacagg gcaccgctgg ctgaaggggg gcgtggtgct 180 gaaggaggac gcgctgcccg gccagaaaac ggagttcaag gtggactccg acgaccagtg 240 gggagagtac tcctgcgtct tcctccccga gcccatgggc acggccaaca tccagctcca 300 cgggcctccc agagtgaagg ctgtgaagtc gtcagaacac atcaacgagg gggagacggc 360 catgctggtc tgcaagtcag agtccgtgcc acctgtcact gactgggcct ggtacaagat 420 cactgactct gaggacaang ccctcatgaa cggctccgan agcaggntct tcntgagttc 480 ctcgnagggc cgg 493 238 503 DNA Homo sapiens misc_feature (1)...(503) n = A,T,C or G 238 gggaatgccg gcacggtggg gtctntgctc ccgcagcagt ggccacttcg cctcctggtg 60 caatccctaa ggaagcctgc ggaggagcac ccctgcaggg tctgcctggc gaagccctgg 120 gctgccctgc gggtgtgggc acccccgtgc cagcagatgg cactcagacc cttacctgtg 180 cacacacctc tgctcctgag agcacagccc caaccaacca cctggtggct ggcagggcca 240 tgaccctgag tcctcaggaa gaagtggctg caggccaaat ggccagctcc tcgaggagcg 300 gacctgtaaa actagaattt gatgtatctg atggcgccac cagcaaaagg gcacccccac 360 caaggagact gggagagagg tccggcctca agcctccctt gaggaaagca gcagtgaggc 420 agcaaaaggc cccgcaggag gtggaggagg acgacggtag gagcggagca ggagaggacc 480 cccccatgcc agcttctcgg ggc 503 239 502 DNA Homo sapiens misc_feature (1)...(502) n = A,T,C or G 239 ggaaccactc ggcgccgcct ggtgcatggg aggggagccg ggccaggaac aatatgttag 60 ccgtgcactt tgacaagccg ggaggaccgg aaaacctcta cgtgaaggag gtggccaagc 120 cgagcccggg ggagggtgaa gtcctcctga aggtggcggc cagcgccctg aaccgggcgg 180 acttaatgca gagacaaggc cagtatgacc cacctccagg agccagcaac attttgggac 240 ttgaggcatc tggacatgtg gcagagctgg ggcctggctg ccagggacac tggaagatcg 300 gggacacagc catggctctg ctccccggtg ggggccaggc tcagtacgtc actgtccccg 360 aagggctcct catgcctatc ccanagggat tgaccctgac ccangctgca gccatcccag 420 aggcctggct caccgccttc cagctgttac atcttgtggg aaatgttcan gctggagact 480 atgtgctaat ccatgcagga ct 502 240 822 DNA Homo sapiens misc_feature (1)...(822) n = A,T,C or G 240 ggcaggctgg ctgggtaccg gctgtcgctg acccaggaga agctgcctgt ctacatcagc 60 ctgggctgca gcgcgctgcc gccgcggggc cggcagccat ggccaaggac atcctgggtg 120 aagcagggct acactttgat gaactgaaca agctgagggt gttggaccca gaggttaccc 180 agcagaccat agagctgaag gaagagtgca aagactttgt ggacaaaatt ggccagtttc 240 agaaaatagt tggtggttta attgagcttg ttgatcaact tgcaaaagaa gcagaaaatg 300 aaaagatgaa ggccatcggt gctcggaact tgctcaaatc tatagcaaag cagagagaag 360 ctcaacagca gcaacttcaa gccctaatag cagaaaagaa aatgcagcta gaaaggtatc 420 gggttgaata tgaagctttg tgtaaagtag aagcagaaca aaatgaattt attgaccaat 480 ttatttttca gaaatgaact gaaaatttcg cttttatagt aggaaggcaa aacaaaaaaa 540 agcctctcaa aaccaaaaaa acctctgtag cattccagcg gcttgaccaa tgacctatgt 600 cacaagaggt ggctgtaagg aatgcagccc cctgaagaca gcactacaag tctgggggag 660 ccagttttaa catcagtgca cagctgctgc tggtggccct gcagtgtacg ttctcncctc 720 ttatgcttag ttggaactaa gcagtttgta aactttcatc cttttttttg taaattcnca 780 aagctttgga aggagaagca ataaattttt gttttcaaat gg 822 241 614 DNA Homo sapiens 241 gctttagttc tttgcaagaa ggtagagata aagacacttt ttcaaaaatg gcaatggtat 60 cagaattcct caagcaggcc tggtttattg aaaatgaaga gcaggaatat gttcaaactg 120 tgaagtcatc caaaggtggt cccggatcag cggtgagccc ctatcctacc ttcaatccat 180 cctcggatgt cgctgccttg cataaggcca taatggttaa aggtgtggat gaagcaacca 240 tcattgacat tctaactaag cgaaacaatg cacagcgtca acagatcaaa gcagcatatc 300 tccaggaaac aggaaagccc ctggatgaaa cactgaagaa agcccttaca ggtcaccttg 360 aggaggttgt tttagctctg ctaaaaactc cagcgcaatt tgatgctgat gaacttcgtg 420 ctgccatgaa gggccttgga actgatgaag atactctaat tgagattttg gcatcaagaa 480 ctaacaaaga aatcagagac attaacaggg tctacagaga ggaactgaag agagatctgg 540 ccaaagacat aacctcagac catctggaga tttcggaacg ctttgctttc tcttgctaaa 600 ggtgaccgat ctga 614 242 607 DNA Homo sapiens misc_feature (1)...(607) n = A,T,C or G 242 gctgcgatgg ctgcctgcga gggcaggaga agntgagctc tcggttcctc tcagtcggac 60 ttcctgacgc cgccagtggg cggggcccct tgggccgtcg ccaccactgt agtcatgtac 120 ccaccgccgc cgccgccgcc tcatcgggac ttcatctcgg tgacgctgag ctttggcgag 180 agctatgaca acagcaagag ttggcggcgg cgctcgtgct ggaggaaatg gaagcaactg 240 tcgagattgc agcggaatat gattctcttc ctccttgcct ttctgctttt ctgtggactc 300 ctcttctaca tcaacttggc tgaccattgg aaagctctgg ctttcaggct agaggaagag 360 cagaagatga ggccagaaat tgctgggtta aaaccagcaa atccacccgt cttaccagct 420 cctcagaagg cggacaccga ccctgagaac ttacctgaga tttcgtcaca gaagacacaa 480 agacacatcc agcggggacc acctcacctg cagattagac ccccaagcca agacctgaag 540 gatgggaccc aggaggaggc cacaaaaagg caaagaagcc cctgtggatc cccgcccgga 600 aggagat 607 243 609 DNA Homo sapiens misc_feature (1)...(609) n = A,T,C or G 243 taaaagtaga aggggcccag aaccagggca agaagnntga gggtgcccag aaccagggca 60 aaaaggccga gggggcccag aatcagggca aaaaggccga gggggcccag aaccagggca 120 agaaggcaga gggggcccag aaccagggca agaaggccga gggggcccag aaccaggaca 180 agaaggccga gggggcccag aaccagggca ggaaggccga gggggcccag aaccagggca 240 agaaggccga gggggcccag aaccagggca agaaggccga gggggccccg aaccagggca 300 agaaggccga gggggccccg aaccagggca agaaggccga gggggccccg aaccagggca 360 agaaggccga ggggaccccg aaccagggca agaaggccga ggggaccccn aaccagggca 420 agaaggccga ggggaccccg aaccaggcaa gaaggccnag gggaccccga accaggcaag 480 aaggccgagg gggcccagaa ccangcaaaa aggccgaggg gcccagaacc anggcaagaa 540 ggcnagggga ccccgaacca anggcaanaa ggccnaaggg nnccccgaac canggcaaaa 600 aaggcntaa 609 244 609 DNA Homo sapiens misc_feature (1)...(609) n = A,T,C or G 244 gggtttgggg aaacggccgc tgagtgaggc gtcngctgtg tttctcaccg cggtcttttc 60 ctcccactct tggctggttg gaccccgcta tggaaaagtt ggcccctgag ccagagctcc 120 agcagccttg ttagggcgtg gcctgaggct tggataagtg ggatgtaaaa cgaagatcag 180 gagcagattt gaagaattac aaagtgaatt ggtgccagtc agcatgtcag agacagacca 240 catagcctct acttcctctg ataaaaatgt tgggaaaaca cctgaattaa aggaagactc 300 atgcaacttg ttttctggca atgaaagcag caaattagaa aatgagtcca aactattgtc 360 attaaacact gataaaactt tatgtcaacc taatgagcat aataatcgaa ttgaagccca 420 ggaaaattat attccagatc atggtggagg tgaggattct tgtgccaaaa cagacacagg 480 ctcagaaaat tctgaacaaa tagctaattt tcctagtgga aattttgcta aacatatttc 540 aaaaacaaat gaaacagaac agaaagtaac acaaatattg gtggaattaa ggncatctac 600 atttccaga 609 245 641 DNA Homo sapiens 245 ggggaactcc ttgctgctgt gaggaaggcc caggccaatg tgatgctctt cttagaggag 60 aaggagcaag ctgcgctgag ccaggccaac ggtatcaagg cccacctgga gtacaggagt 120 gccgagatgg aggagtactg caagtttaag aacactgaag acatcacctt ccctagtgtt 180 tacgtagggc tgaaggataa actctcgggc atccgcaaag ttatcacgga atccactgta 240 cacttaatcc agttgctgga gaactataag aaaaagctcc aggagttttt ccaaggaaga 300 ggagtatgac atcagaactc aagtgtctgc cgttgttcag cgcaaatatt ggacttccaa 360 acctgagccc agcaccaggg aacagttcct ccaatatgcg tatgacatca cgtttgaccc 420 ggacacagca cacaagtatc tccggctgca ggaggagaac cgcaaggtca ccaacactac 480 gccctgggag catccctacc cggacctccc cagcaggttc ctgcactggc ggcaggtgct 540 gtcccagcag agtctgtacc tgcacaggta ctattttgag gtggagatct tcggggcagg 600 cacctatgtt ggcctgacct gcaaaggcat cgacccggaa a 641 246 370 DNA Homo sapiens misc_feature (1)...(370) n = A,T,C or G 246 ccatgttagg ttaaaccaaa atgnccatct cacaaaaacn agtttagttg aatgctcaat 60 tccttacaaa atcttcattt aaggctgagt gtggtggctc atgcctgtaa ttccagcact 120 ttgggaggcc gaggcaggaa gactgcttga gcccaggagt tcaagaccag cctgggcaac 180 ataggaagac tccgtctcta catagaaatt ttattataaa agtagctggg gatggtggca 240 cacacctgta gtcccagcta ctcaggaggt tgaggtggga agattgcttg agcccaggag 300 tttgaggttg cagtgagctg atcatnccac tgcactytag cytaggctac agagcagsac 360 cccatttccm 370 247 504 DNA Homo sapiens misc_feature (1)...(504) n = A,T,C or G 247 gagagagaga gaagaaaaaa aattaaagga agttntggat agccttaaac aggaaacaca 60 agggcttcag aaagaaaaag aaagtcgaga gaaagaactt atgggtttca gcaaatcggt 120 aaatgaagca cgttcaaaga tggatgtagc ccagtcagaa cttgatatct atctcagtcg 180 tcataatact gcagtgtctc aattaactaa ggctaaggaa gctctaattg cagcttctga 240 gactctcaaa gaaaggaaag ctgcaatcag agatatagaa ggaaaactcc ctcaaactga 300 acaagaatta aaggagaaag aaaaagaact tcaaaaactg acacaagaag aaacaaactt 360 taaaagtttg gttcatgatc tctttcaaaa agttgaagaa gcaaagagct cattagcaat 420 gaatcgaagt agggggaaag tccttgatgc aataattcaa gaaaaaaaat ctggcaggat 480 tccaggaata tatggaagat tggg 504 248 502 DNA Homo sapiens misc_feature (1)...(502) n = A,T,C or G 248 ggccnggaga agcgngacan ccgcaacatg gaagtacagg tcacccagga gatgcgcaac 60 gtcagtatag gcatgggcag cagtgacgag tggtctgatg ttcaagacat tattgactcc 120 acgccagagc tggacatgtg tccagagacc cgcctggacc gcacaggaag caggtactgg 180 ctcagcccag gccctggggt cctgggggct cagtatttat ccaagtcggt gtctttggtg 240 aagtcttcct ctgccaccac tgggagaacc atcaaggctg tggcctaccg cctccatcca 300 cgaccggcct gcagggcagt gtgggcattg aatggccacg gtgggaggtg ttggaagagt 360 ctgcagacgc agccactccg ggctgcccac tgactctgct ctctctcccg acctgtggat 420 cccaacagcc caacccaggg catcgtgaac aaagctttcg gcatcaacac cgactccctg 480 taccatgagc tgtcgacggc ag 502 249 503 DNA Homo sapiens 249 ggtccatacg gcgttgttct ggattcccgt cgtaacttaa agggaaactt tcacaatgtc 60 cggagccctt gatgtcctgc aaatgaagga ggaggatgtc cttaagttcc ttgcagcagg 120 aacccactta ggtggcacca atcttgactt ccagatggaa cagtacatct ataaaaggaa 180 aagtgatggc atctatatca taaatctcaa gaggacctgg gagaagcttc tgctggcagc 240 tcgtgcaatt gttgccattg aaaaccctgc tgatgtcagt gttatatcct ccaggaatac 300 tggccagagg gctgtgctga agtttgctgc tgccactgga gccactccaa ttgctggccg 360 cttcactcct ggaaccttca ctaaccagat ccaggcagcc ttccgggagc cacggcttct 420 tgtggttact gaccccaggg ctgaccacca gcctctcacg gaggcatctt atgttaacct 480 acctaccatt gcgctgtgaa cac 503 250 1430 DNA Homo sapiens 250 aggtgagaga ggatgtgtgc tgggccttgg aggaaggggg ccgagaccgg gccttacttc 60 tgtaacgata ctgtgaggca tcggaaggcc agcctgttgt gtccgttttg aaggatgccc 120 ctgtcccgct ggttgagatc tgtgggggtc ttcctgctgc cagcccccta ctgggcaccc 180 cgggagaggt ggctgggttc cctacggcgg ccctccctgg tgcacgggta cccagtcctg 240 gcctggcaca gtgcccgctg ctggtgccaa gcgtggacag aggaacctcg agccctttgc 300 tcctccctca gaatgaacgg agaccagaat tcagatgttt atgcccaaga aaagcaggat 360 ttcgttcagc acttctccca gatcgttagg gtgctgactg aggatgagat ggggcaccca 420 gagataggag atgctattgc ccggctcaag gaggtcctgg agtacaatgc cattggaggc 480 aagtataacc ggggtttgac ggtggtagta gcattccggg agctggtgga gccaaggaaa 540 caggatgctg atagtctcca gcgggcctgg actgtgggct ggtgtgtgga actgctgcaa 600 gctttcttcc tggtggcaga tgacatcatg gattcatccc ttacccgccg gggacagatc 660 tgctggtatc agaagccggg cgtgggtttg gatgccatca atgatgctaa cctcctggaa 720 gcatgtatct accgcctgct gaagctctat tgccgggagc agccctatta cctgaacctg 780 atcgagctct tcctgcagag ttcctatcag actgagattg ggcagaccct ggacctcctc 840 acagcccccc agggcaatgt ggatcttgtc agattcactg aaaagaggta caaatctatt 900 gtcaagtaca agacagcttt ctactccttc taccttccta tagctgcagc catgtacatg 960 gcaggaattg atggcgagaa ggagcacgcc aatgccaaga agatcctgct ggagatgggg 1020 gagttctttc agattcagga tgattacctt gacctctttg gggaccccag tgtgaccggc 1080 aaaattggca ctgacatcca ggacaacaaa tgcagctggc tggtggttca gtgtctgcaa 1140 cgggccactc cagaacagta ccagatcctg aaggaaaatt acgggcagaa ggaggctgag 1200 aaagtggccc gggtgaaggc gctatatgag gagctggatc tgccagcagt gttcttgcaa 1260 tatgaggaag acagttacag ccacattatg gctctcattg aacagtacgc agcacccctg 1320 cccccagccg tctttctggg gcttgcgcgc aaaatctaca agcggagaaa gtgacctaga 1380 gattgcaagg gcggggagag gaggctctca ataaataatc gtgtaacctt 1430 251 424 DNA Homo sapiens 251 gggccgcccc gccagggacc caccaatgag gcacagatgg cagccgctgc cgccctagcc 60 cggctggagc agaagcagtc ccgggcctgg ggccccacat cgcaggacac catccgaaac 120 caggtgagaa aggaacttca agccgaagcc accgtcagcg ggagccccga ggccccaggg 180 accaacgtgg tatctgagcc cagagaggaa ggctctgccc acctggctgt gcctggcgtg 240 tacttcacct gtccgctcac tggggccacc ctgaggaagg accagcggga cgcctgcatc 300 aaggaggcca ttctcttgca cttctccacc gacccagtgg ccgcctccat catgaagatc 360 tacacgttca acaaagacca ggaccgggtg aagctgggtg tggacaccat tgccaagtac 420 ctgg 424 252 423 DNA Homo sapiens misc_feature (1)...(423) n = A,T,C or G 252 gagaggctga ccgtntggct gcactgcggg ctaaggagga gctggagaga caggcggtgg 60 atcatgataa agagccanga gcanctggct gcggagcttg cagaatacac tgccaagatt 120 gccctcctgg aagaggcgcg gaggcgcaan gaggatgaag ttgaagagtg gcagcncagg 180 gccaaagaag cccangatga cctgttgaag accaaggagg agctgcacct ggtgatgaca 240 gcacccccgc ccccaccacc ccccgtgtac gagccggtga gctaccatgt ccaggagagc 300 ttgcaggatg agggcgcaga gcccacgggc tacagcgcgg agctgtctat tgagggcatc 360 cgggatgacc gcaatgagga gaagcgcatc actgaggcag agaagaacga gcgtgtgcag 420 cgg 423 253 423 DNA Homo sapiens misc_feature (1)...(423) n = A,T,C or G 253 gcatatgccc tgtngnttct tncacctatg tgccnagccg gcngggggaa gatgccnaaa 60 tggcgaccgg caactacttt ggattcaccc acagcggggc ggcggcggcn gcngctgcgg 120 cccaatatag ncagcagnca gcttcgggtg tagcctattc tcatccaact acagttgcta 180 gctacactgt ccatcaggct ccagnagctg ctcacacagt tactgctgcc tatgcaccag 240 canccgncac anttgcagtt gccaggnctg ctccancagn tgatgcagna gctgcaacan 300 ctgctgctta tggaggctac cccactgcnc acacaggcaa ctgactatgg ttatactcnn 360 aggcaacaag aagcaccacc accaccacgc cncagctact acacaaaact accaggattc 420 ata 423 254 423 DNA Homo sapiens 254 gcagatcttt cctgaaatca tgccactttg gccccaagaa ccactactgc cccatcttcc 60 gactgggctc cgtgatccgc tgggccggga gcgacttcca ggatatagcc ctggagggtg 120 gcgtgatagg aattaatatt gaatggaact gtgatcttga taaagctgcc tctgagtgcc 180 accctcacta ttcttttagc cgtctggaca ataaactttc aaagtctgtc tcctccgggt 240 acaacttcag atttgccaga tattaccgag acgcagccgg ggtggagttc cgcaccctga 300 tgaaagccta cgggatccgc tttgacgtga tggtgaacgg caaggcaggg aagttcagca 360 tcattcccac catcatcaac gtgggctctg gggtggcgct catgggtgct ggtgctttct 420 tct 423 255 424 DNA Homo sapiens misc_feature (1)...(424) n = A,T,C or G 255 ggggagacag gctgagccgn ctgggcggtc tggnctgtac ggggcggggg aggccatggc 60 ctcggctgag ttgcagggga agtaccacca agctggctca ggagtactcn aagcttcggg 120 ctcagaatca ngttctgaaa aaaggtgttn nggatgaaca ancaaattct ncagctntaa 180 agctaaactg ngaggaccaa tctgttttca ctttgatata ngagcaactg aaaatgaang 240 atcagtcatt gagaaaacta caacaggaaa tggacagttt gacatttcga aatctgcagc 300 ttgccaagag ggnacaacta cttcaagatg aactagctct aactgaacca cgaggcnana 360 aaaacaagaa aagtggagaa tcttcttctc agttgagnca agagcacaaa gagtgtcttt 420 gatg 424 256 423 DNA Homo sapiens 256 ggagcgcagc agccatggca agccgtctcc tgctcaacaa cggcgccaag atgcccatcc 60 tggggttggg tacctggaag tcccctccag ggcaggtgac tgaggccgtg aaggtggcca 120 ttgacgtcgg gtaccgccac atcgactgtg cccatgtgta ccagaatgag aatgaggtgg 180 gggtggccat tcaggagaag ctcagggagc aggtggtgaa gcgtgaggag ctcttcatcg 240 tcagcaagct gtggtgcacg taccatgaga agggcctggt gaaaggagcc tgccagaaga 300 cactcagcga cctgaagctg gactacctgg acctctacct tattcactgg ccgactggct 360 ttaagcctgg gaaggaattt ttcccattgg atgagtcggg caatgtggtt cccagtgaca 420 cca 423 257 363 DNA Homo sapiens misc_feature (1)...(363) n = A,T,C or G 257 gggacaaggc tccagggcag ctggagtgtg aaacggccat tgcagctctg aacagttgtc 60 tacggnacct anaccaggnt tccttnnttn nntnnnnccn cnacttgntn cccnngaggg 120 aanntctcaa gaggccttgc acactcagat gctcactgca gtccaagaga tctcccatct 180 cattgagccg ctggccaatg ctgcccgggc tgaagcctcc cagctgggac acaaggtgtc 240 ccagatggcg cagtactttg agccgctcac cctggctgca gtgggtgctg cctccaagac 300 cctgagccac ccgcagcaga tggcactcct ggaccagact aaaacattgg cagagtctgc 360 cct 363 258 164 DNA Homo sapiens misc_feature (1)...(164) n = A,T,C or G 258 gccacggtgt cacctcggcc ccggacacca ggcnngcccc gggctccacc gcccccccag 60 cccacggtgt cacctcggcc ccggacanca aggcgggccc ggggttcanc ngccccncca 120 gnccacngtg tnacctcggc cccngacacc atgtcggacc cggg 164 259 600 DNA Homo sapiens misc_feature (1)...(600) n = A,T,C or G 259 ggcctgctgg ggcagggggc tacccagggg cttcctatcc tggggcctac cccgggcagg 60 cacccccagg ggcttatcct ggacaggcac ctccaggcgc ctaccctgga gcacctggag 120 cttatcccgg agcacctgca cctggagtct acccagggcc acccagcggc cctggggcct 180 acccatcttc tggacagcca agtgcccccg gagcctaccc tgccactggc ccctatggcg 240 cccctgctgg gccactgatt gtgccttata acctgccttt gcctggggga gtggtgcctc 300 gcatgctgat aacaattctg ggcacggtga ancccaatgc aaacagaatt gctttanatt 360 tccaaanang gaatgatgtt gccttccact ttaacccacg cttcaatgaa aacaacagga 420 gantcattgt ttgcaataca aagctggata ataactgggg aagggaanaa agacagtcng 480 nttttccatt tgaaaatggg aaaccattca aaatacaagt actggttgaa ccttaccact 540 ttaaaggttg caagtgaatg atgcttactt tgttgcaana caatcattng ggtaaaaaac 600 260 294 DNA Homo sapiens misc_feature (1)...(294) n = A,T,C or G 260 ggagctgctc tgctacgtac gaaaccccga cccagaagca ggtcgtctac gaatggttta 60 gcgccaggtt ccccacgaac gtgcggtgcg tgacgggcga gggggcggcc gcctttccgg 120 ccgcgccccg tttcccagga cgaagggcac tccgcaccgg accccggtcc cggcgcgcgg 180 cggggcacgc gccctcccgc gcgcgcgggg cgcgtggagg ggggggggcg gnccnncggn 240 ggggacaggc gggggaccgg ntntccnang ccaaccgagg ctccgnggng ctgc 294 261 601 DNA Homo sapiens 261 ggcggttcgt ggttgccttg gtcctcctga acgtcgcagc ggcgggagcc gtgccgctct 60 tggccaccga aagcgtcaag caagaagaag ctggagtacg gccttctgca ggaaacgtct 120 ccacccaccc cagcttgagc caacggcctg gaggctctac caagtcgcat ccggagccgc 180 agactccaaa agacagccct agcaagtcga gtgcggaggc gcagacccca gaagacaccc 240 ccaacaagtc gggtgcggag gcaaagaccc aaaaagacag ctccaacaag tcgggtgcgg 300 aggcaaagac ccaaaaaggc agcactagca agtcgggttc ggaggcgcag accacaaaag 360 acagcactag taagtcgcat ccggagctgc agactccaaa agacagcact ggcaaatcgg 420 gtgcggaggc gcagacccca gaagacagcc ccaacaggtc gggtgcggag gcaaagaccc 480 aaaaagacag ccctagcaag tcaggttcgg aggcgcagac cacaaaagat gtccctaata 540 agtcgggtgc ggacggccag accccaaaag acggctccag caagtcgggt tgcggaggat 600 c 601 262 559 DNA Homo sapiens misc_feature (1)...(559) n = A,T,C or G 262 gctaaatcaa ggtgtacaga attggagaaa gaaattgaag aactcagatc aaaacctgtt 60 actgaaggaa ctggtgatat tattaaggca ttaactgaac gtctggatgc tcttcttctg 120 gaaaaagcag agactgagca acagtgtctt tctctgaaaa aggaaaatat aaaaatgaag 180 caagaggttg aggattctgt aacaaagatg ggagatgcac ataaggagtt ggaacaatca 240 catataaact atgtgaaaga aattgaaaat ttgaaaaatg agttgatggc agtacgttcc 300 aaatacagtg aagacaaatg ctaacttaca aaagcagctg gaagaagcaa tgaatacgca 360 attagaactt tcagaacaac ttaaatttca gaacaactct gaagataatg ttaaaaaact 420 acaagaagag attgagaaaa ttaggccacg ctttgaggag caaattttat atctgcaaaa 480 gcaattagac gctaccactg atgaaaanaa ggaaacagtt actcaactnc naantatcat 540 tgaggctaat tctcatcgt 559 263 602 DNA Homo sapiens misc_feature (1)...(602) n = A,T,C or G 263 gcggaggtgc gcagcaaatg cgaggagctg agtggcctcc acgggcagct ccaggaggcc 60 agggcggaga actcccagct cacagagaga atccgttcca ttgaggccct gctggaggcg 120 ggccaggcgc gggatgccca ggacgtccag gccagccagg cggaggctga ccagcagcag 180 actcgcctca aggagctgga gtcccaggtg tcgggtctgg agaaggaggc catcgagctc 240 agggaggccg tcgagcagca gaaagtgaag aacaatgacc tccgggagaa gaactggaag 300 gccatggagg cactggccac ggccgagcag gcctgcaagg agaagctgca ctccctgacc 360 caggccaagg aggaatcgga gaagcagctc tgtctgattg aggcgcagac catggaggcc 420 ctgctggctc tgctcccaga actctctgtc ttggcacaac agaattacac cgagtggctg 480 caggatctca aagagaaagg ccccacgctg ctgaagcacc cgccagctcc ccgcggagcc 540 ctcctcggac ctggcctcca agttgaggga ggccgaggag acgcanagca cactgcaggc 600 cg 602 264 1232 DNA Homo sapiens 264 aggaaggtgg caagatggtg ttggaaagca ctatggtgtg tgtggacaac agtgagtata 60 tgcggaatgg agacttctta cccaccaggc tgcaggccca gcaggatgct gtcaacatag 120 tttgtcattc aaagacccgc agcaaccctg agaacaacgt gggccttatc acactggcta 180 atgactgtga agtgctgacc acactcaccc cagacactgg ccgtatcctg tccaagctac 240 atactgtcca acccaagggc aagatcacct tctgcacggg catccgcgtg gcccatctgg 300 ctctgaagca ccgacaaggc aagaatcaca agatgcgcat cattgccttt gtgggaagcc 360 cagtggagga caatgagaag gatctggtga aactggctaa acgcctcaag aaggagaaag 420 taaatgttga cattatcaat tttggggaag aggaggtgaa cacagaaaag ctgacagcct 480 ttgtaaacac gttgaatggc aaagatggaa ccggttctca tctggtgaca gtgcctcctg 540 ggcccagttt ggctgatgct ctcatcagtt ctccgatttt ggctggtgaa ggtggtgcca 600 tgctgggtct tggtgccagt gactttgaat ttggagtaga tcccagtgct gatcctgagc 660 tggccttggc ccttcgtgta tctatggaag agcagcggca gcggcaggag gaggaggccc 720 ggcgggcagc tgcagcttct gctgctgagg ccgggattgc tacgactggg actgaaggtg 780 aaagaggtgg aatccgaagt cctgggactg cgggatgcta aacattgaaa gctgggtgta 840 ggcactgcag ggagagtgtg gaggtctgac agggtaggaa tatgtgggag ggctgggcta 900 ggaatggcct tggaggctgg cctgtgtgga tatggcacca attctaccct gctccccttt 960 tccttttccc agactcagac gatgccctgc tgaagatgac catcagccag caagagtttg 1020 gccgcactgg gcttcctgac ctaagcagta tgactgagga agagcagatt gcttatgcca 1080 tgcagatgtc cctgcaggga gcagagtttg gccaggcgga atcagcagac attgatgcca 1140 gctcagctat ggacacatcc gagccagcca aggaggagga tgattacgac gtgatgcagg 1200 accccgagtt ccttcagagt gtcctagaga ac 1232 265 2766 DNA Homo sapiens 265 cgattggtgt tggcggtctg gctcagctgg gcagggggta actttactga tttgggggtg 60 gtttttagtt taatttttct tttctagctt cccatcgacg gtcagtgcgc acgttgtaat 120 cagctgaggc catgtcagga gacggagcca cggagcaggc agctgagtat gtcccagaga 180 aggtgaagaa agcggaaaag aaattagaag agaatccata tgaccttgat gcttggagca 240 ttctcattcg agaggcacag aatcaaccta tagacaaagc acggaagact tatgaacgcc 300 ttgttgccca gttccccagt tctggcagat tctggaaact gtacattgaa gcagagatta 360 aagctaaaaa ttatgacaag gttgaaaagc tatttcagag atgccttatg aaggttttgc 420 acattgattt atggaagtgt tatctttcat atgtccgaga aaccaagggt aaactaccaa 480 gttacaaaga aaaaatggct caagcatatg actttgcact ggataaaatt ggaatggaaa 540 ttatgtccta tcagatttgg gtggattaca tcaatttcct aaaaggcgtg gaagctgtag 600 gatcttatgc agaaaatcaa agaataacag ctgtccgaag agtttatcaa cgaggttgtg 660 ttaatccgat gatcaacatt gaacagctct ggagagacta taacaagtat gaagagggta 720 tcaatattca tttagctaaa aaaatgattg aagatcggag tagagattat atgaatgcta 780 gacgtgtagc aaaggaatat gagacagtaa tgaaaggctt ggaccgtaat gctccctcgg 840 tgcctcctca gaatactcct caagaagctc aacaagtaga tatgtggaag aaatatatac 900 agtgggaaaa gagcaaccct cttcgtacag aggatcagac ccttataaca aaaagagtta 960 tgtttgctta tgaacagtgc ctgcttgtgc tgggccatca ccctgatatt tggtatgaag 1020 ctgcccagta tcttgagcag tcaagtaaac tgctcgcaga aaagggagat atgaataatg 1080 ccaaattatt tagtgatgaa gctgctaata tatatgaaag agccataagc actttattga 1140 agaagaatat gcttctttat tttgcatatg cagattatga agagagtcgc atgaagtatg 1200 aaaaggttca cagtatatat aacagacttc tggcaattga ggatattgac cctaccttgg 1260 tatatatcca atatatgaaa tttgcacgga gagcagaagg catcaaatct ggaagaatga 1320 tatttaaaaa agcaagagaa gataccagaa cccgccacca tgtctatgtt actgcagcac 1380 tcatggaata ttactgtagt aaggacaaat ctgttgcctt taagattttt gagctggggc 1440 taaaaaaata tggagacatt ccagagtatg tcctggccta tattgactat ctttctcacc 1500 tcaatgagga caataatacc cgagttttgt ttgaacgagt tttaacatct ggaagccttc 1560 ctcctgagaa gtctggagaa atctgggccc gatttctagc atttgaaagt aatattggtg 1620 atctagctag tatactcaaa gtggagaaaa gacggtttac agcattcaaa gaagagtatg 1680 aagggaaaga aacggcttta ctagtagata gatacaagtt catggattta tatccttgct 1740 ctgcaagtga attaaaagca cttggttata aggatgtctc ccgtgctaag ctagcagcta 1800 taattccgga cccagttgta gctccttcta tagtgcctgt tctgaaagat gaagtggata 1860 gaaaaccaga ataccctaaa ccagacactc agcagatgat tccatttcag ccacgacatt 1920 tagcacctcc aggtttacac cctgtacctg gtggagtgtt cccagtccct cctgcagctg 1980 ttgttttaat gaaacttctc cctcctccta tctgtttcca gggtcctttt gtacaagtgg 2040 atgaactgat ggaaattttc cgaagatgca agataccaaa tactgttgag gaagctgtga 2100 ggatcattac tggtggggcc ccagagctag ctgtagaagg caacggcccc gtggaaagta 2160 atgcagtact caccaaggcc gtcaaaaggc ccaacgagga ttcagatgaa gatgaagaaa 2220 agggagccgt tgtcccccct gttcatgaca tttacagagc acggcagcag aagcggattc 2280 ggtagggttt taaacgcctc tacagaaaac tcctgtccag gattcctttt gcctcaagtg 2340 gtatgtttaa aagagacaac gctttgttac aaggttcttg gaaacaaagt tgtattgtca 2400 ttggtgcctc tatcacatgg ttcttgagaa aaaacaaacc aacctgtgtg aattttagaa 2460 tacggaacag acctatgctc taagcaaaat taggttttca aaaatgtgag aacagtacaa 2520 agtggcagaa ccacattttg ttccctcttc aagggtgtct tgtatgtgcc gcttgaagat 2580 ttgtgagttt ttcaacagtt ttattttaaa aactggatgg cttatgattg taaagcattt 2640 tatcacattt tctgaaaaca attgttcttg gtttgcttat gtagagtcct gccttattgt 2700 ttgtttttat ttatggcaga atgtatgaaa tccgttttgt agtttcaaat tttaaaagtc 2760 ctttaa 2766 266 4643 DNA Homo sapiens 266 ggcacgaggc aggatgtaga gtgctgttca ggttctccag cggagtcccc gaaggggcca 60 gcttcattga aagcttctgc acagtgagag gaggagttac agtacctgac tcggggctgc 120 tctgaatcca gtgctgctca gccgggaaga tactttccaa gcgttatgaa ggcggagaag 180 gatcccgaag acgaggaaaa tatccttaga gatccaagct aagtgtagcc cagcatgaac 240 aggaagattt gtaaagagag ggacggattc accggctagg agtttgtctg agggcgtgct 300 ttgtgagccg cagagatttg taacttaatg ccaagtagtt tgctgctagc aaccagaaac 360 caaatcctgt ctatgatgaa ctgttggttt tcttgtgctc ccaagaacag acatgcagca 420 gattggaaca aatatgatga ccgattgatg aaagccgcgg agaggggaga tgtagaaaaa 480 gtttcctcaa tccttgctaa aaagggcatc aatccaggca aactagatgt ggaaggcaga 540 tctgccttcc atgttgtggc ctcaaagggg aatcttgaat gtttgaatgc catccttata 600 catggagttg atattacaac cagtgacact gcaggaagaa atgctcttca cttggctgca 660 aagtatgggc atgcattgtg tctacaaaaa cttctacagt acaattgtcc cactgaacat 720 gcagacctgc agggaagaac cgcacttcat gacgcagcaa tggcagactg tccttccagc 780 atacagctgc tttgtgacca tggggcctcc gtgaatgcca aagatgtgga tgggcggaca 840 ccgctggttc tggctactca gatgtgtagg ccagcaatct gtcaactgct gatagatcga 900 ggggcagaga ttaattccag agacaaacaa aacagaactg ctctcatgct tggttgcgag 960 tatggttgta aggatgctgt agaagtctta cttaaaaatg gtgctgatgt aagcctgctg 1020 gatgccttgg gccatgatag ttcttactat gcaagaattg gtgacaatct ggacattcta 1080 actttattga agactgcgtc agaaaatacc aacaaaggga gagaactttg gaagaaagga 1140 ccatctttac agcagcgaaa tttgccgtac atgctagatg aagtaaatgt gaagtcaagt 1200 cagagggagc atcgaaacat tcaggagctg gagattgaaa atgaagattt gaaagacagg 1260 ttgagaaaaa ttcagcaaga acagagaata ttactggata aagtcaatgg tttacaacta 1320 cagctgaatg aggaagtgat ggttgctgat gatctggaaa gtgagaaaga aaagctgaag 1380 tctcttttgg tggctaaaga aaagcaacat gaagaaagcc taagaactat tgagtctctg 1440 aaaaacagat ttaaatattt tgagtgtact tccccagggg tgccagccca catgcaaagc 1500 aggtctatgt taagaccact ggagctatca ttacccaatc aaacctcata ttctgaaaat 1560 gacctcttaa agaaagagtt agaagcaatg agaactttct gcgaatcagc caaacaagac 1620 cgcctcaagc tccagaacga gctggcgcac aaggtggctg agtgcaaagc tttaggacta 1680 gaatgtgaac gcatcaagga ggactctgat gagcagataa agcagttaga agacgcattg 1740 aaagatgtgc agaagagaat gtatgagtcg gaaggtaaag taaaacaaat gcagacacac 1800 tttcttgccc ttaaagagca cctgaccagt gaagcagcta tagggaatca cagactaatg 1860 gaggagctga aggatcagtt gaaggacatg aaagcgaaat atgagggtgc atcagcagaa 1920 gtgggaaaac tgcgaaacca aatcaaacaa aatgagctgc tagtagaaca gtttaggaga 1980 gatgaaggca agctggtgga agagaataag cgattgcaga aggaactcag tatgtgtgaa 2040 acggagcgag acaagaaagg aaggagggtt gctgaggtgg aaggccaggt aaaggaactc 2100 ttagcaaagc tgaccttgtc agttccaact gaaaaatttg agagcatgaa gagcttatta 2160 tcaagcgaag taaatgagaa ggtgaaaaaa attggagaga cagaaagaga gtatgaaaaa 2220 tcacttactg aaatcagaca gttaaggaga gagcttgaga attgtaaggc caaacttgct 2280 cagcatgtca agccagagga gcatgagcag ctcaagagca gactggagca aagagcagga 2340 gaacttgcaa agaaggtcac ggaactcacg tcgaaaaatc aggtgttgca aagggacgtt 2400 gaaaaggttt atctggataa taagctcctc aatcagcaag tacataattt aacaagtgaa 2460 ataaaaagtc attatgttcc cctacaagtg agtgaagaaa tgaaaaagtc acatgatgtc 2520 accgtcgagg aactgaagaa acagctttta gatgtcacgc aaaaatgcgc agacaagcag 2580 ctggaaatgg agaaattgct gttggaaaat gacagtttaa gtaagaacgt tagccgccta 2640 gaaactgtat ttgtgcctcc tgagaaacac caaaaagagg tcacagctct gaaatccagc 2700 gtcgctgacc tcaaacgaca gctgttggaa ctgaacaaga agtgtgggga agaccgagag 2760 aaaataaacg ccctcgtgtc ggaaaacact agcttgaaaa agaccctgag taatcagtat 2820 gtgccggcta agacccacga ggaggttaag accgcgctga gtggcacgct ggataagacc 2880 aatagagaat tactagatgc gaagaagaaa tgggaagatc tcaatcagga atttgtaaaa 2940 acaaaagatg agaatgaaat cctcaaaaga aacctggaaa acactcagag ccaaataaaa 3000 gccgagtaca tcagcctccg tgagcatgaa gagaagatga gtgccataaa tcagaacatg 3060 aagagtgtac aggataacag tgcagaaata ctggccaact acagaaaggg ccaagaggag 3120 attgtgacac tacacgcaga aattgaagcc cagaaaaagg aacttgacac catccaagaa 3180 tgcattaagc tcaaatatgc tcctattatc agcttcgaag agtgcgagag aaaatttaaa 3240 gccacagaga aagaactaaa agaacagtta tcggagcaga tgcaaaaata tcacgtcagg 3300 gaagaagagg ccaagaagta caagcaagag aatgacaagt tgaagaagga gattttcact 3360 cttcagaagg atttaaaaga taagaatgtt ctcatcgaga actctcatga catggaaaga 3420 gcactcaaca gaaaagcaga agagctcaac aaacagttga aagacctgtt gcagaagtac 3480 agtgagataa agactgagaa ggagaagctg gttgacgaca atgccagaca gacttctgag 3540 cttcttgcag cccagaccct tctgcaaaag caacatgttc cattggaaca agttgagacc 3600 ctgaaaaaat ctcttaacag cacaattgag catctcaagg aagaactgaa gaataagcaa 3660 aagtgttatg agaaagagca gcagacagtg gccaaactgc atcagatgct agagaaccaa 3720 aagaactctt cagtgcccct gggagagcat ctgcgggtta aggaagcctt tgagaaggaa 3780 gtgggcatga taaaggccag cctgagggaa aaggaagaag aaagccaaaa caaaaccgaa 3840 gaggtctcca aactgcagtc tgaggttcag gacacaaaac aagcattaca aaaactagag 3900 acaagagagg tagttgattt gtctaaatat aaagcaacaa aaagtgatct ggagacccag 3960 atttccaacc tgaatgaaaa actggccaat ctgaatcgga agtatgaaga agcctgcgag 4020 gaggtgctgc gtgcccaaag gaagcaactg tctgccaaag atgagaagga attgctacat 4080 ttcagcattg agcaagaaat caaggatcag caggagcggt gtgataagtc cttaacaaca 4140 atcacagagt tacagaaaag aatacaggaa tctgccaaac agatcgaagc gaaagataat 4200 aagataacag aactgcttaa tgatgtggaa agactcaaac aggcactcag tggcctttca 4260 cagctcactt ccccgagcgg gagtcccagc aagaggcaga gccagctgat tgacaccctg 4320 cagcaccaag tgaagtctct acagcagcag ctggctgaca ctgacagaca gcaccaagag 4380 gtgattgcga tttatcggac acaccttctg agtgctgcac agggtcacat ggatgaagat 4440 gtgcaggcgg ccttactcca gatcatacag atgaggcagg ggctcgtgtg ctagcaggca 4500 gtgctgactg gctggtgtgt gctttggctg atggtgccaa gcattcccct tgcaactcca 4560 tggcctttct gggccttgtg ctagtataat tgaaataaaa tatattttgt ttcatcaaaa 4620 aaaaaaaaaa aaaaaaaaaa aaa 4643 267 993 DNA Homo sapiens 267 gggcgccggg gccgcggccg ggccggctcc agggccggcc gccgcggtgg ggcgcaggcc 60 gccgcgcgag tgaatcgagg cggcgggccc atccggaacc ggccggccat cgcccgcggc 120 gcggccggcg gaggcggcag gaaccgaccg gcgccctaca gcaggccaaa acaacttccc 180 gacaagtggc agcacgatct tttcgacagt ggcttcggcg gtggtgccgg cgtggagaca 240 ggtgggaaac tgctggtgtc caatctggat tttggagtct cagacgccga tattcaggaa 300 ctctttgctg aatttggaac gctgaagaag gcggctgtgc actatgatcg ctctggtcgc 360 agcttaggaa cagcaaacgt gcactttgag cggaaggcag atgccctgaa ggccatgaag 420 cagtacaacg gcttccctct ggatggccgc cccatgaaca ttcagcttgt cacgtcacag 480 attgacgcac agcggaggcc tgcacagagc gtaaacagag gtggcatgac tagaaaccgt 540 ggcgctggag gttttggtgg tggtggaggc acccggagag gcacccgcgg aggcgcccgt 600 ggaagaggca gaggtgccgg caggaattca aagcagcagc tttcggcaga ggagctggat 660 gcccagctgg acgcctataa tgcgagaatg gacaccagtt aaacagacca gcaaatccgc 720 gtgcggaaca ggacccaggc gtctcctctt gctccctggt tggggggcgg tggctggggc 780 tgtgcgccca atgatggatt tgtttctttt atgttttaaa ataggattta aaaactcatg 840 taaaggtttt ttttttttct tttttttttt tttaattctg aaacagacct gttttgtacc 900 gagttatttt tgggataaat tttactggtt gctgttgtgg agaaggtggc gtttccaccg 960 gaattccaaa aaaaaaaaaa aaaaaaaaaa aaa 993 268 3294 DNA Homo sapiens 268 ctggcgccgc cgcgcagcac ggctcagacc gaggcgcaca ggctcgcagc tccgcggcgc 60 ctagcgctcc ggtccccgcc gcgacgcgcc accgtccctg ccggcgcctc cgcgcgcttc 120 gaaatgaggg tcctgggtgg gcgctgcggg gcgttgctgg cgtgtctcct cctagtgctt 180 cccgtctcag aggcaaactt tttgtcaaag caacaggctt cacaagtcct ggttaggaag 240 cgtcgtgcaa attctttact tgaagaaacc aaacagggta atcttgaaag agaatgcatc 300 gaagaactgt gcaataaaga agaagccagg gaggtctttg aaaatgaccc ggaaacggat 360 tatttttatc caaaatactt agtttgtctt cgctcttttc aaactgggtt attcactgct 420 gcacgtcagt caactaatgc ttatcctgac ctaagaagct gtgtcaatgc cattccagac 480 cagtgtagtc ctctgccatg caatgaagat ggatatatga gctgcaaaga tggaaaagct 540 tcttttactt gcacttgtaa accaggttgg caaggagaaa agtgtgaatt tgacataaat 600 gaatgcaaag atccctcaaa tataaatgga ggttgcagtc aaatttgtga taatacacct 660 ggaagttacc actgttcctg taaaaatggt tttgttatgc tttcaaataa gaaagattgt 720 aaagatgtgg atgaatgctc tttgaagcca agcatttgtg gcacagctgt gtgcaagaac 780 atcccaggag attttgaatg tgaatgcccc gaaggctaca gatataatct caaatcaaag 840 tcttgtgaag atatagatga atgctctgag aacatgtgtg ctcagctttg tgtcaattac 900 cctggaggtt acacttgcta ttgtgatggg aagaaaggat tcaaacttgc ccaagatcag 960 aagagttgtg aggttgtttc agtgtgcctt cccttgaacc ttgacacaaa gtatgaatta 1020 ctttacttgg cggagcagtt tgcaggggtt gttttatatt taaaatttcg tttgccagaa 1080 atcagcagat tttcagcaga atttgatttc cggacatatg attcagaagg cgtgatactg 1140 tacgcagaat ctatcgatca ctcagcgtgg ctcctgattg cacttcgtgg tggaaagatt 1200 gaagttcagc ttaagaatga acatacatcc aaaatcacaa ctggaggtga tgttattaat 1260 aatggtctat ggaatatggt gtctgtggaa gaattagaac atagtattag cattaaaata 1320 gctaaagaag ctgtgatgga tataaataaa cctggacccc tttttaagcc ggaaaatgga 1380 ttgctggaaa ccaaagtata ctttgcagga ttccctcgga aagtggaaag tgaactcatt 1440 aaaccgatta accctcgtct agatggatgt atacgaagct ggaatttgat gaagcaagga 1500 gcttctggaa taaaggaaat tattcaagaa aaacaaaata agcattgcct ggttactgtg 1560 gagaagggct cctactatcc tggttctgga attgctcaat ttcacataga ttataataat 1620 gtatccagtg ctgagggttg gcatgtaaat gtgaccttga atattcgtcc atccacgggc 1680 actggtgtta tgcttgcctt ggtttctggt aacaacacag tgccctttgc tgtgtccttg 1740 gtggactcca cctctgaaaa atcacaggat attctgttat ctgttgaaaa tactgtaata 1800 tatcggatac aggccctaag tctatgttcc gatcaacaat ctcatctgga atttagagtc 1860 aacagaaaca atctggagtt gtcgacacca cttaaaatag aaaccatctc ccatgaagac 1920 cttcaaagac aacttgccgt cttggacaaa gcaatgaaag caaaagtggc cacatacctg 1980 ggtggccttc cagatgttcc attcagtgcc acaccagtga atgcctttta taatggctgc 2040 atggaagtga atattaatgg tgtacagttg gatctggatg aagccatttc taaacataat 2100 gatattagag ctcactcatg tccatcagtt tggaaaaaga caaagaattc ttaaggcatc 2160 ttttctctgc ttataatacc ttttccttgt gtgtaattat acttatgttt caataacagc 2220 tgaagggttt tatttacaat gtgcagtctt tgattatttt gtggtccttt cctgggattt 2280 ttaaaaggtc ctttgtcaag gaaaaaaatt ctgttgtgat ataaatcaca gtaaagaaat 2340 tcttacttct cttgctatct aagaatagtg aaaaataaca attttaaatt tgaatttttt 2400 tcctacaaat gacagtttca atttttgttt gtaaaactaa atttttaatt ttatcatcat 2460 gaactagtgt ctaaatacct atgttttttt cagaaagcaa ggaagtaaac tcaaacaaaa 2520 gtgcgtgtaa ttaaatacta ttaatcatag gcagatacta ttttgtttat gtttttgttt 2580 ttttcctgat gaaggcagaa gagatggtgg tctattaaat atgaattgaa tggagggtcc 2640 taatgcctta tttcaaaaca attcctcagg gggaccagct ttggcttcat ctttctcttg 2700 tgtggcttca catttaaacc agtatcttta ttgaattaga aaacaagtgg gacatatttt 2760 cctgagagca gcacaggaat cttcttcttg gcagctgcag tctgtcagga tgagatatca 2820 gattaggttg gataggtggg gaaatctgaa gtgggtacat tttttaaatt ttgctgtgtg 2880 ggtcacacaa ggtctacatt acaaaagaca gaattcaggg atggaaagga gaatgaacaa 2940 atgtgggagt tcatagtttt ccttgaatcc aacttttaat taccagagta agttgccaaa 3000 atgtgattgt tgaagtacaa aaggaactat gaaaaccaga acaaatttta acaaaaggac 3060 aaccacagag ggatatagtg aatatcgtat cattgtaatc aaagaagtaa ggaggtaaga 3120 ttgccacgtg cctgctggta ctgtgatgca tttcaagtgg cagttttatc acgtttgaat 3180 ctaccattca tagccagatg tgtatcagat gtttcactga cagtttttaa caataaattc 3240 ttttcactgt attttatatc acttataata aatcggtgta taatctaaaa aaaa 3294 269 1412 DNA Homo sapiens 269 caccgtcgtc cgcaaagcct gagtcctgtc ctttctctct ccccggacag catgagcttc 60 accactcgct ccaccttctc caccaactac cggtccctgg gctctgtcca ggcgcccagc 120 tacggcgccc ggccggtcag cagcgcggcc agcgtctatg caggcgctgg gggctctggt 180 tcccggatct ccgtgtcccg ctccaccagc ttcaggggcg gcatggggtc cgggggcctg 240 gccaccggga tagccggggg tctggcagga atgggaggca tccagaacga gaaggagacc 300 atgcaaagcc tgaacgaccg cctggcctct tacctggaca gagtgaggag cctggagacc 360 gagaaccgga ggctggagag caaaatccgg gagcacttgg agaagaaggg accccaggtc 420 agagactgga gccattactt caagatcatc gaggacctga gggctcagat cttcgcaaat 480 actgtggaca atgcccgcat cgttctgcag attgacaatg cccgtcttgc tgctgatgac 540 tttagagtca agtatgagac agagctggcc atgcgccagt ctgtggagaa cgacatccat 600 gggctccgca aggtcattga tgacaccaat atcacacgac tgcagctgga gacagagatc 660 gaggctctca aggaggagct gctcttcatg aagaagaacc acgaagagga agtaaaaggc 720 ctacaagccc agattgccag ctctgggttg accgtggagg tagatgcccc caaatctcag 780 gacctcgcca agatcatggc agacatccgg gcccaatatg acgagctggc tcggaagaac 840 cgagaggagc tagacaagta ctggtctcag cagattgagg agagcaccac agtggtcacc 900 acacagtctg ctgaggttgg agctgctgag acgacgctca cagagctgag acgtacagtc 960 cagtccttgg agatcgacct ggactccatg agaaatctga aggccagctt ggagaacagc 1020 ctgagggagg tggaggcccg ctacgcccta cagatggagc agctcaacgg gatcctgctg 1080 caccttgagt cagagctggc acagacccgg gcagagggac agcgccaggc ccaggagtat 1140 gaggccctgc tgaacatcaa ggtcaagctg gaggctgaga tcgccaccta ccgccgcctg 1200 ctggaagatg gcgaggactt taatcttggt gatgccttgg acagcagcaa ctccatgcaa 1260 accatccaaa agaccaccac ccgccggata gtggatggca aagtggtgtc tgagaccaat 1320 gacaccaaag ttctgaggca ttaagccagc agaagcaggg taccctttgg ggagcaggag 1380 gccaataaaa agttcagagt tcattggatg tc 1412 270 1163 DNA Homo sapiens 270 ccggcggcgc ctcaggtcgc ggggcgccta ggcctgggtt gtcctttgca tctgcacgtg 60 ttcgcagtcg tttccgcgat gctgcctctg ctgcgctgcg tgccccgtgt gctgggctcc 120 tccgtcgccg gcctccgcgc tgccgcgccc gcctcgcctt tccggcagct cctgcagccg 180 gcaccccggc tgtgcacccg gcccttcggg ctgctcagcg tgcgcgcagg ttccgagcgg 240 cggccgggcc tcctgcggcc tcgcggaccc tgcgcctgtg gctgtggctg cggctcgctg 300 cacaccgacg gagacaaagc ttttgttgat ttcctgagtg atgaaattaa ggaggaaaga 360 aaaattcaga agcataaaac cctccctaag atgtctggag gttgggagct ggaactgaat 420 gggacagaag cgaaattagt gcggaaagtt gccggggaaa aaatcacggt cactttcaac 480 attaacaaca gcatcccacc aacatttgat ggtgaggagg aaccctcgca agggcagaag 540 gttgaagaac aggagcctga actgacatca actcccaatt tcgtggttga agttataaag 600 aatgatgatg gcaagaaggc ccttgtgttg gactgtcatt atccagagga tgaggttgga 660 caagaagacg aggctgagag tgacatcttc tctatcaggg aagttagctt tcagtccact 720 ggcgagtctg aatggaagga tactaattat acactcaaca cagattcctt ggactgggcc 780 ttatatgacc acctaatgga tttccttgcc gaccgagggg tggacaacac ttttgcagat 840 gagctggtgg agctcagcac agccctggag caccaggagt acattacttt tcttgaagac 900 ctcaagagtt ttgtcaagag ccagtagagc agacagatgc tgaaagccat agtttcatgg 960 caggctttgg ccagtgaaca aatcctactc tgaagctaga catgtgcttt gaaatgatta 1020 tcatcctaat atcatggggg aaaaaatacc aaatttaaat tatatgtttt gtgttctcat 1080 ttattatcat ttttttctgt acaaatctat tatttctaga tttttgtata acatgataga 1140 cataaaattg gtttatctcc tcc 1163 271 1873 DNA Homo sapiens 271 gcagcttcca aaggccagat gatggaggag cgtgccaacc taatgcacat gatgaaactc 60 agcattaagg tcttactcca gtctgccctt agcctgggac gcagtctgga tgcagactat 120 gcccccttgc agcagttctt tgtagtgatg gagcattgcc tcaaacatgg gctgaaagtc 180 aagaagagtt tcattggcca aaataagtct ttctttggtc ctttggagct ggtggaaaaa 240 ctttgtcctg aagcatcaga tatagctacc agtgtcagga atcttccaga actaaagaca 300 gccgtgggaa gaggcagagc gtggctttat cttgctctca tgcaaaagaa actggcagat 360 tatctgaaag tcctcataga caacaagcag ctcctaagtg agttctatga gcccgaggcc 420 ttaatgatgg aggaggaagg aatggtgatc gtgggtctgc tggtgggact caacgttctg 480 gatgccaacc tctgcttaaa gggagaagac ttggattctc aggtcggggt gattgacttc 540 tctctctgtc tgaaggacgc acaggatctt gacagcggca gagagcatga aagaattacc 600 gatgtccttg atcagaaaaa ttacgtggag gaacttaacc ggcacctaag ctgcacagtt 660 ggggatcttc agacgaagat agatggcttg gaaaagacga actcaaagct tcaagaagag 720 ctttcagctg caacagaccg aatttgttcc ctacaaaaag aacaacagca gctgagagaa 780 cagaacgaag taattcgaga aagaagcgag aagagcgtgg agataacaaa acaggacacc 840 aaggttgaac tggagactta taagcagacc cggcagggcc tggatgaaat gtacagcgat 900 gtgtggaaac agctgaagga ggagaagaaa gtccgactgg aactggaaaa ggagctggag 960 ttgcagattg gaatgaagac ggagatggag atcgccatga agttgctgga gaaggacacc 1020 catgagaagc aagacacgct cgttgccctc cgccagcagc tggaggaagt caaagctatt 1080 aacttacaga tgtttcacaa agttcagagt gcagagagca gtttacagca gaagaatgaa 1140 gccatcgcat cttttgaaag aaaaaccact caggtgatgt ccagcatgaa gcaaatggag 1200 gaaaggttac agcaggcgga acgggccagg caggcggcag aagagcgaag ccacaagttg 1260 cagcaggagc tcagcgggag gggcagtgcc ctgcagctcc agctgtccca gctgcgggac 1320 caatgctcag gtctagagaa agaactgaag tcagaaaaag aacaaagaca agctctccag 1380 agagaactac agcgtgagaa agacacctct tgtctgctcc agacggagct gcagcaagtg 1440 gaaggcctga agaaggagct gcgggagctc caagacgaga aggcagagtt acggaaggtc 1500 tgtgaggagc aggagcaagc cctccaggaa atgggactac accttagcca gtccaaactg 1560 aagatggaag acataaagga agtaaataag gcactgaagg gccacacatg gttgaaagat 1620 gacgaggcaa cacattgtaa gcagtgtgag aaggacttct ccatttctcg gaggaagcac 1680 cactgccgaa actgtggcca cattttctgc aacacctgct ctagcaacga gctggccctg 1740 ccctcctacc ctaagccagt gcgtgtgtgt gacagctgcc ataccctgct actgcagcgc 1800 tgctcctcca ctgcctcctg aagccatcag ccccacccac cagcagggac cttgacctgt 1860 tgaagccgaa ttc 1873 272 1946 DNA Homo sapiens 272 aattcggggg ccgtggagtt tgtgacatac gaggtgacac ccctcgagtc acttcccttc 60 aactccagct ggagcgcctg cttggctttg ggttcgttct gcagccttcg ccccgctcct 120 agcctcaggg ccggactcca gcgcagagcc cagcccagcg cagcctgcca gcagccaccc 180 agccgcccag ccgcccagcc ccgcacgaaa cccggccaga gcttcctagc agcccgagcc 240 atgaacaccg aaatgtatca gacccccatg gaggtggcgg tctaccagct gcacaatttc 300 tccatctcct tcttctcttc tctgcttgga ggggatgtgg tttccgttaa gctggacaac 360 agtgcctccg gagccagcgt ggtggccata gacaacaaga tcgaacaggc catggatctg 420 gtgaagaatc atctgatgta tgctgtgaga gaggaggtgg agatcctgaa ggagcagatc 480 cgagagctgg tggagaagaa ctcccagcta gagcgtgaga acaccctgtt gaagaccctg 540 gcaagcccag agcagctgga gaagttccag tcctgtctga gccctgaaga gccagctccc 600 gaatccccac aagtgcccga ggcccctggt ggttctgcgg tgtaagtggc tctgtcctca 660 gggtgggcag agccactaaa cttgttttac ctagttcttt ccagtttgtt tttggctccc 720 caagcatcat ctcacgagga gaactttaca cctagcacag ctggtgccaa gagatgtcct 780 aaggacatgg ccacctgggt ccactccagc gacagacccc tgacaagagc aggtctctgg 840 aggctgagtt gcatggggcc tagtaacacc aagccagtga gcctctaatg ctactgcgcc 900 ctgggggctc ccagggcctg ggcaacttag ctgcaactgg caaaggagaa gggtagtttg 960 aggtgtgaca ccagtttgct ccagaaagtt taaggggtct gtttctcatc tccatggaca 1020 tcttcaacag cttcacctga caacgactgt tcctatgaag aagccacttg tgttttaagc 1080 agaggcaacc tctctcttct cctctgtttc gtgaaggcag gggacacaga tgggagagat 1140 tgagccaagt cagccttctg ttggttaata tggtataatg catggctttg tgcacagccc 1200 agtgtgggat tacagctttg ggatgaccgc ttacaaagtt ctgtttggtt agtattggca 1260 tagtttttct atatagccat aaatgcgtat atatacccat agggctagat ctgtatctta 1320 gtgtagcgat gtatacatat acacatccac ctacatgttg aagggcctaa ccagccttgg 1380 gagtattgac tggtccctta cctcttatgg ctaagtcttt gactgtgttc atttaccaag 1440 ttgacccagt ttgtctttta ggttaagtaa gaactcgaga gtaaaggcaa ggaggggggc 1500 cagcctctga atgcggccac ggatgccttg ctgctgcaac cctttcccca gctgtccact 1560 gaaacgtgaa gtcctgtttt gaatgccaaa cccaccattc actggtgctg actacataga 1620 atgggttgag agaagatcag tttgggcttc acagtgtcat ttgaaaaagc gtttttgttt 1680 tgttttgaat tattgtggaa aactttcaag tgaacagaag gatggtgtcc tactgtggat 1740 gagggatgaa caaggggatg gctttgatcc aatggagcct gggaggtgtg cccagaaagc 1800 ttgtctgtag cgggttttgt gagagtgaac actttccact ttttgacacc ttatcctgat 1860 gtatggttcc aggatttgga ttttgatttt ccaaatgtag cttgaaattt caataaactt 1920 tgctctgttt ttctaaaaaa taaaaa 1946 273 2795 DNA Homo sapiens 273 gagagttccc catctgaggc gtttgttgca gctacctgca cttctagatt catcttcttg 60 tgagccctgg gcttaggagt caccatggca actgaagagt tcatcatccg catcccccca 120 taccactata tccatgtgct ggaccagaac agcaacgtgt cccgtgtgga ggtcgggcca 180 aagacctaca tccggcagga caatgagagg gtactgtttg ccccatgcgc atggtgaccg 240 tccccccacg tcactactgc acagtggcca accctgtgtc tcgggatgcc caggcttggt 300 gctgtttgat gtcacagggc aagttcggct tcgccacgct gacctcgaga tccggctggc 360 ccaggacccc ttccccctgt acccagggga ggtgctggaa aaggacatca cacccctgca 420 ggtggttctg cccaacactg ccctccatct aaaggcgctg cttgattttg aggataaaga 480 tggagacaag gtggtggcag gagatgagtg gcttttcgag ggacctggca cgtacatccc 540 ccggaaggaa gtggaggtcg tggagatcat tcaggccacc atcatcaggc agaaccaggc 600 tctgcggctc agggcccgca aggagtgctg ggaccgggac ggcaaggaga gggtgacagg 660 ggaagaatgg ctggtcacca cagtaggggc gtacctccca gcggtgtttg aggaggttct 720 ggatttggtg gacgccgtca tccttacgga aaagacagcc ctgcacctcc gggctcggcg 780 gaacttccgg gacttcaggg gagtgtcccg ccgcactggg gaggagtggc tggtaacagt 840 gcaggacaca gaggcccacg tgccagatgt ccacgaggag gtgctggggg ttgtgcccat 900 caccaccctg ggcccccaca actactgcgt gattctcgac cctgtcggac cggatggcaa 960 gaatcagctg gggcagaagc gcgtggtcaa gggagagaag tcttttttcc tccagccagg 1020 agagcagctg gaacaaggca tccaggatgt gtatgtgctg tcggagcagc aggggctgct 1080 gctgagggcc ctgcagcccc tggaggaggg ggaggatgag gagaaggtct cacaccaggc 1140 tggggaccac tggctcatcc gcggacccct ggagtatgtg ccatctgcca aagtggaggt 1200 ggtggaggag cgccaggcca tccctctaga cgagaacgag ggcatctatg tgcaggatgt 1260 caagaccgga aaggtgcgcg ctgtgattgg aagcacctac atgctgaccc aggacgaagt 1320 cctgtgggag aaagagctgc ctcccggggt ggaggagctg ctgaacaagg ggcaggaccc 1380 tctggcagac aggggtgaga aggacacagc taagagcctc cagcccttgg cgccccggaa 1440 caagacctgt gtggtcagct accgcgtgcc ccacaacgct gcggtgcagg tgtacgacta 1500 ccgagagaag cgagcccgcg tggtcttcgg gcctgagctg gtgtcgctgg gtcctgagga 1560 gcagttcaca gtgttgtccc tctcagctgg gcggcccaag cgtccccatg cccgccgtgc 1620 gctctgcctg ctgctggggc ctgacttctt cacagacgtc atcaccatcg aaacggcgga 1680 tcatgccagg ctgcaactgc agctggccta caactggcac tttgaggtga atgaccggaa 1740 ggacccccaa gagacggcca agctcttttc agtgccagac tttgtaggtg atgcctgcaa 1800 agccatcgca tcccgggtgc ggggggccgt ggcctctgtc actttcgatg acttccataa 1860 gaactcagcc cgcatcattc gcactgctgt ctttggcttt gagacctcgg aagcgaaggg 1920 ccccgatggc atggccctgc ccaggccccg ggaccaggct gtcttccccc aaaacgggct 1980 ggtggtcagc agtgtggacg tgcagtcagt ggagcctgtg gatcagagga cccgggacgc 2040 cctgcaacgc agcgtccagc tggccatcga gatcaccacc aactcccagg aagcggcggc 2100 caagcatgag gctcagagac tggagcagga agcccgcggc cggcttgagc ggcagaagat 2160 cctggaccag tcagaagccg agaaagctcg caaggaactt ttggagctgg aggctctgag 2220 catggccgtg gagagcaccg ggactgccaa ggcggaggcc gagtcccgtg cggaggcagc 2280 ccggattgag ggagaagggt ccgtgctgca ggccaagcta aaagcacagg ccttggccat 2340 tgaaacggag gctgagctcc agagggtcca gaaggtccga gagctggaac tggtctatgc 2400 ccgggcccag ctggagctgg aggtgagcaa ggctcagcag ctggctgagg tggaggtgaa 2460 gaagttcaag cagatgacag aggccatagg ccccagcacc atcagggacc ttgctgtggc 2520 tgggcctgag atgcaggtaa aactgctcca gtccctgggc ctgaaatcaa ccctcatcac 2580 cgatggctcc actcccatca acctcttcaa cacagccttt gggctgctgg ggatggggcc 2640 cgagggtcag cccctgggca gaagggtggc cagtgggccc agccctgggg aggggatatc 2700 cccccagtct gctcaggccc ctcaagctcc tggagacaac cacgtggtgc ctgtactgcg 2760 ctaactcctg attaatacaa tggaagtttc tgggc 2795 274 4166 DNA Homo sapiens 274 cggcgcgggt gttgagagcg gtgtggtagg tgttgtagcc gctatggtga agttcgcttt 60 gtagcggccc cggctagaga gttggcctgt tccctgcctt tgtgacccgg aggagctttt 120 gggggtgcgt caagcccctg gcctgaggca gcgaactggt ttgtggcctg tttgattcct 180 gtcagaggtt tgctgaccca agacagtatc gaaaatgcat attaagtcaa ttattctaga 240 gggattcaag tcctatgctc agaggaccga agtcaatggt tttgaccccc tcttcaatgc 300 tatcactggc ttaaatggta gtgggaaatc caacatattg gactccatct gctttttact 360 gggcatctcc aacctgtctc aggttcgggc ttctaattta caagatttag tttacaaaaa 420 tgggcaggct ggtattacca aagcctctgt gtcaatcact tttgataatt ctgacaaaaa 480 gcaaagtcct ttaggatttg aggttcatga tgaaatcaca gtaacaaggc aggtggttat 540 tggtggtaga aataaatatt taatcaatgg agtcaatgcc aacaacacca gagtacagga 600 tctcttctgt tctgttggcc ttaatgttaa caaccctcac tttctcatca tgcagggccg 660 aattacaaaa gtattgaata tgaaaccacc agagatttta tccatgatag aagaagcagc 720 tggaaccagg atgtatgaat acaaaaaaat agctgcacag aaaactatag aaaaaaagga 780 ggctaagctg aaagaaatta agacgatact tgaagaagag attactccaa ccattcaaaa 840 attaaaagag gaaagatcgt cctacttgga gtaccaaaaa gtaatgagag aaatagaaca 900 tttgagtcgt ttatatattg cttatcagtt tttgctggct gaagatacca aagtacgctc 960 agctgaggaa ttaaaagaaa tgcaagataa agttataaag cttcaggaag aattgtctga 1020 gaatgataaa aaaataaaag cacttaatca tgaaatagaa gaattggaaa aaagaaaaga 1080 taaggaaact ggagttatac ttcgatcttt agaagatgct cttgcagagg ctcagcgagt 1140 taatactaaa tctcaaagcg catttgatct caagaagaaa aatctggcat gtgaggaaag 1200 caaacgcaaa gagctggaaa aaaatatggt tgaggactca aaaactttag cagcaaagga 1260 aaaagaggtt aaaaagataa cagatggact gcatgccctt caagaagcaa gtaataaaga 1320 tgctgaagct ctggcagctg cacagcagca cttcaatgct gtttccgctg gcctgtccag 1380 taatgaagat ggagcagaag caactcttgc tggtcaaatg atggcctgta aaaatgatat 1440 aagtaaagct cagacagaag ccaaacaggc tcagatgaag ttgaagcatg ctcaacagga 1500 attaaagaat aaacaagctg aagttaagaa gatggatagt ggctacagga aggatcaaga 1560 agctctagaa gctgtaaaaa gacttaaaga aaaacttgaa gctgaaatga aaaagctaaa 1620 ttatgaagaa aataaagagg aaagcctttt ggaaaagcgc aggcagctgt ctcgtgatat 1680 tggtagattg aaagaaacat atgaagctct attagccaga tttcccaatc ttcgatttgc 1740 atacaaggat ccagagaaga actggaatag aaattgtgtg aaaggacttg tggcttctct 1800 gattagtgtg aaagacactt ctgcaaccac agctttagaa ttagtggctg gagaacgact 1860 ctacaatgtt gtagtagaca cagaagttac tggtaaaaag ctactagaaa ggggggaact 1920 gaaacgtcga tacactataa ttccactcaa taaaatttca gccagatgta ttgcaccaga 1980 aactctgaga gttgctcaga atcttgttgg ccctgacaac gttcatgtgg ctctttcctt 2040 ggttgaatat aaaccagaac ttcagaaagc aatggagttt gtctttggaa caacatttgt 2100 ttgtgacaat atggataatg ccaaaaaagt ggcctttgat aagaggataa tgactagaac 2160 tgtaactctc ggaggtgatg tgtttgatcc tcatgggaca ttgagtggag gtgctcgatc 2220 ccaggcagct tccattttaa ccaagtttca agaactcaaa gatgttcagg atgaactgag 2280 aatcaaagag aatgagctgc gggctctaga agaggaatta gcaggtctta aaaacactgc 2340 tgaaaagtat cgccaactaa aacagcagtg ggagatgaaa actgaagagg cagatttatt 2400 acaaaccaag ctccagcaaa gctcatatca caagcaacaa gaagaattag atgcccttaa 2460 aaaaaccatt gaggaaagtg aggagacttt gaaaaacact aaagaaatcc aaagaaaagc 2520 agaagaaaaa tatgaagtat tggaaaataa aatgaaaaat gcagaagctg aaagagagcg 2580 agaactgaaa gatgctcaga aaaaactgga ttgtgccaaa acaaaggcag atgcatctag 2640 caagaagatg aaagaaaaac aacaggaagt tgaagctatc actctggaac tggaagagct 2700 caagagagag catacatctt acaaacaaca gcttgaagct gtaaatgaag ctatcaaatc 2760 ctatgaaagt cagattgaag taatggcagc tgaggtggct aaaaataagg agtcagtaaa 2820 taaagctcaa gaagaggtga ccaagcaaaa agaggtgata acagcccaag acactgtaat 2880 taagctaaat atgcagaagt ggcaaaacac aaggagcaaa acaatgattc tcagccttaa 2940 aattaaggaa ttagaccacc acatcagcaa acataaacgg gaggctgaag atggtgctgc 3000 aaaggtatcc aaaatgttga aagattatga ctggattaat gcagagagac acctctttgg 3060 ccaacccaat agtgcctatg atttcaaaac taacaaccct aaagaagctg gtcagagact 3120 tcagaagttg caagaaatga aggagaaact aggaagaaat gtcaatatga gagctatgaa 3180 tgtattgaca gaagctgaag agcgatgcaa tgacttgatg aagaagaaga gaattgtaga 3240 aaatgacaaa tccaaaattc ttacaactat agaagacctt gaccagaaga aaaaccaagc 3300 cctaaatatt gcatggcaaa aggtgaacaa ggactttggg tctatttttt ctactctttt 3360 gcctggtgct aatgctatgc ttgcaccacc agagggtcaa actgttttgg atggtctgga 3420 gttcaaggtt gccttaggaa atacctggaa agaaaaccta actgaactta gtggtggtca 3480 gaggtcttta gtggccttgt cattaatact gtccatgctt ctcttcaaac ctgctccaat 3540 ttatatcctt gatgaggtag atgcagcctt ggatctttct catacccaaa acattggaca 3600 gatgctgcgt actcatttca cacattctca gttcattgtg gtgtcactaa aagaaggtat 3660 gttcaacaat gcaaacgttc ttttcaaaac caagtttgtg gatggtgttt ctacagtagc 3720 cagatttact caatgtcaaa atggaaagat ttcaaaggaa gcaaaatcca aggcaaaacc 3780 acccaaagga gcacatgtgg aagtttaaac tacaaagtta tttcttcatc ttgacctgtt 3840 tttttaaatg taaactttta aggacttgag ataactaatt tgtttatata caaaaattaa 3900 tgttactgtg ttacttaacc catgttttct ctttatataa tcacttatcg cttacaaatg 3960 agcatatatt cctcatctct taactagtct aattatggtc caattattgt ggttgtgatt 4020 ttatgcatat ccatcaaaat gttttttttc ttatgcgggt cttttatata ttagggatcc 4080 tgagataccc gattctatat gtaaaagcta atatacaaaa aagcagatta aattacatga 4140 taaatgtagc tgaaaaaaaa aaaaaa 4166 275 1193 DNA Homo sapiens 275 actgttctcg cgttcgcgga cggctgtggt gttttggcgc atgggcggag cgtagttacg 60 gtcgactggg gcgtcgtccc tagcccggga gccgggtctc tggagtcgcg gcccggggtt 120 cacgatgtcc gacgaggaag cgaggcagag cggaggctcc tcgcaggccg gcgtcgtgac 180 tgtcagcgac gtccaggagc tgatgcggcg caaggaggag atagaagcgc agatcaaggc 240 caactatgac gtgctggaaa gccaaaaagg cattgggatg aacgagccgc tggtggactg 300 tgagggctac ccccggtcag acgtggacct gtaccaagtc cgcaccgcca ggcacaacat 360 catatgcctg cagaatgatc acaaggcagt gatgaagcag gtggaggagg ccctgcacca 420 gctgcacgct cgcgacaagg agaagcaggc ccgggacatg gctgaggccc acaaagaggc 480 catgagccgc aaactgggtc agagtgagag ccagggccct ccacgggcct tcgccaaagt 540 gaacagcatc agccccggct ccccagccag catcgcgggt ctgcaagtgg atgatgagat 600 tgtggagttc ggctctgtga acacccagaa cttccagtca ctgcataaca ttggcagtgt 660 ggtgcagcac agtgaggggg ccctggcacc caccatccta ctttctgtct ctatgaattt 720 gactactcca gggacctcat ctagaagccc ctgaatgtga cagtgatccg caggggggaa 780 aaacaccagc ttagacttgt tccaacacgc tgggcaggaa aaggactgct gggctgcaac 840 attattcctc tgcaaagatg attgtccctg gggaacagta acaggaaagc atcttccctt 900 gccctggact tgggtctagg gatttccaac ttgtcttctc tccctgaagc ataaggatct 960 ggaagaggct tgtaacctga acttctgtgt ggtggcagta ctgtggccca ccagtgtaat 1020 ctccctggat taaggcattc ttaaaaactt aggcttggcc tctttcacaa attaggccac 1080 ggccctaaat aggaattccc tggattgtgg gcaagtgggc ggaagttatt ctggcaggta 1140 ctggtgtgat tattattatt atttttaata aagagtttta cagtgctgat atg 1193 276 1155 DNA Homo sapiens 276 gataaagttt cctgtagtga aagtcagtta caaagccagt gtgaacaaat gaaacagaca 60 aatattaatt tggaaagtag gttgttgaaa gaggaagaac tgcgaaaaga ggaagtccaa 120 actctgcaag ctgaactcgc ttgtagacaa acagaagtta aagcattgag tacccaggta 180 gaagaattaa aagatgagtt agtaactcag agacgtaaac atgcctctag tatcaaggat 240 yctcaccaaac aacttcagca agcacgaaga aaattagatc aggttgagag tggaagctat 300 gacaaagaag tcagcagcat gggaagtcgt tctagttcat cagggtccct gaatgctcga 360 agcagtgcag aagatcgatc tccagaaaat actgggtcct cagtagctgt ggataacttt 420 ccacaagtag ataaggccat gttgattgag agaatagtta ggctgcaaaa agcacatgcc 480 cggaaaaatg aaaagataga atttatggag gaccacatca aacaactggt ggaagaaatt 540 aggaaaaaaa caaaaataat tcaaagttat attttacgag aagaatcagg cacactttct 600 tcagaggcat ctgattttaa caaagttcat ttaagtagac ggggtggcat catggcatct 660 ttatatacat cccatccagc tgacaatgga ttaacattgg agctctcttt ggaaatcaac 720 cgaaaattac aggctgtttt ggaggatacg ttactaaaaa atattacttt gaaggaaaat 780 ctacaaacac ttggaacaga aatagaacgt cttattaaac accagcatga actagaacag 840 aggacaaaga aaacctaaaa caagcctctt gctcagtaaa gagacaaaag ccacacagga 900 gtaggtgcca ctgacctcta ttgttggaga ctttgttcca ctttttgttt cagccagtaa 960 aaatattgtt ttgcttcatc tgtacacaaa aaaataccct tttacaatat gaatgcattg 1020 ctgtatatac tgtaagactg aaagctttga tgaaatttgt ttttgtatgg tgcaatatga 1080 cagcctgtca ttgaatctaa acaacttaat ttgcttgtat tcataagaag tgttgaacat 1140 tacaagggct tttat 1155 277 5850 DNA Homo sapiens 277 caatcccaca gagtattgat gaggaaactg aagtttggag cgatcacatc attttcccaa 60 ggtaacacaa gtggcaagac agccgggaac ccctacccca tccccttatt cagcacatga 120 aataaacaag gggcatccaa atcttgcggc aacgcccccg ggacatgcat cgtcccctgg 180 actctctcaa accccttatc cctctggaca gaatgcaggt ccaaccacgc tggtataccc 240 tcaaacccct cagacaatga attcacaacc tcaaacccgt tctccgtttt tccagaggcc 300 tcaaatacag cctcctagag ctaccatccc gaacagcagt ccttccattc gtcctggtgc 360 acagacaccc actgcagtgt accaggctaa tcagcacatc atgatggtta accatctgcc 420 catgccgtac ccagtgcccc aggggcctca gtactgtata ccacagtacc gtcatagtgg 480 ccctccttat gttgggcccc cccaacaata tccagttcaa ccaccggggc caggtccttt 540 ttatcctgga ccaggacctg gggacttccc caatgcttat ggaacgcctt tttacccaag 600 tcagccggtg tatcagtcag cacctatcat agtgcctacg cagcaacagc cgcctccagc 660 caagagagag aaaaaaacta taagaattcg ggatccaaac cagggaggta aagacataac 720 agaggagatt atgtctggag gtggcagcag aaatcctact ccacccatag gaagacccac 780 gtccacacct actcctcctc agcagctgcc cagccaggtc cccgagcaca gccctgtggt 840 ttatgggact gtggagagcg ctcatcttgc tgccagcacc cctgtcactg cagctagcga 900 ccagaagcaa gaggagaagc caaaaccaga tccagtgtta aagtctcctt ccccagtcct 960 taggctagtc ctcagtggag agaagaaaga acaagaaggc cagacatctg aaactactgc 1020 aatagtatcc atagcagagc ttcctctgcc tccatcacct accactgttt cttctgttgc 1080 tcgaagtaca attgcagccc ccacctcttc tgctcttagt agccaaccaa tattcaccac 1140 tgctatagat gacagatgtg aactctcatc cccaagagaa gacacaattc ctatacccag 1200 cctcacatct tgcacagaaa catcagaccc tttaccaaca aatgaaaatg atgatgatat 1260 atgcaagaaa ccctgtagtg tagcacctaa tgatattcca ctggtttcta gtactaacct 1320 aattaatgaa ataaatggag ttagcgaaaa attatcagcc acggagagca ttgtggaaat 1380 agtaaaacag gaagtattgc cattgactct tgaattggag attctcgaaa atcccccaga 1440 agaaatgaaa ctggagtgta tcccagctcc catcacccct tccacagttc cttcctttcc 1500 tccaactcct ccaactcctc cagcttctcc tcctcacact ccagtcattg ttcctgctgc 1560 tgccactact gttagttctc cgagtgctgc catcacagtc cagagagtcc tagaggagga 1620 cgagagcata agaacttgcc ttagtgaaga tgcaaaagag attcagaaca aaatagaggt 1680 agaagcagat gggcaaacag aagagatttt ggattctcaa aacttaaatt caagaaggag 1740 ccctgtccca gctcaaatag ctataactgt accaaagaca tggaagaaac caaaagatcg 1800 gacccgaacc actgaagaga tgttagaggc agaattggag cttaaagctg aagaggagct 1860 ttccattgac aaagtacttg aatctgaaca agataaaatg agccaggggt ttcatcctga 1920 aagagacccc tctgacctaa aaaaagtgaa agctgtggaa gaaaatggag aagaagctga 1980 gccagtacgt aatggtgctg agagtgtttc tgagggtgaa ggaatagatg ctaattcagg 2040 ctccacagat agttctggtg atggggttac atttccattt aaaccagaat cctggaagcc 2100 tactgatact gaaggtaaga agcagtatga cagggagttt ctgctggact tccagttcat 2160 gcctgcctgt atacaaaaac cagagggcct gcctcctatc agtgatgtgg ttcttgacaa 2220 gatcaaccaa cccaaattgc caatgcgaac tctggatcct cgaattttgc ctcgaggacc 2280 agactttaca ccagcctttg ctgattttgg aaggcagaca cctggtggaa gaggcgtacc 2340 tttgttgaat gttgggtcac gaagatctca acctggccaa agaagagaac ccagaaagat 2400 catcacagtt tctgtaaaag aagatgtaca cctgaaaaag gcagaaaatg cctggaagcc 2460 aagccaaaaa cgagacagcc aagccgatga tcccgaaaac attaaaaccc aggagctttt 2520 tagaaaagtt cgaagtatct taaataaatt gacaccacag atgttcaatc aactgatgaa 2580 gcaagtgtca ggacttactg ttgacacaga ggagcggctg aaaggagtta ttgacctggt 2640 ctttgagaag gctattgatg aacccagttt ctctgtggct tacgcaaaca tgtgtcgatg 2700 tctagtaacg ctgaaagtac ccatggcaga caagcctggt aacacagtga atttccggaa 2760 gctgctactg aaccgttgcc agaaggagtt tgaaaaagat aaagcagatg atgatgtctt 2820 tgagaagaag cagaaagaac ttgaggctgc cagtgctcca gaggagagga caaggcttca 2880 tgatgaactg gaagaagcca aggacaaagc ccggcggaga tccattggca acatcaagtt 2940 tattggagaa ctctttaaac tcaaaatgct gactgaagcc atcatgcatg actgtgtggt 3000 gaagctgcta aagaaccatg atgaagaatc cctggagtgc ctgtgtcgcc tgctcaccac 3060 cattggcaaa gacttggact ttgaaaaagc aaagccacgt atggaccagt actttaatca 3120 gatggagaaa attgtgaaag aaagaaaaac ctcatctagg attcggttca tgcttcaaga 3180 tgttatagac ctaaggctgt gcaattgggt atctcgaaga gcagatcaag ggcctaaaac 3240 tatcgaacag attcacaaag aggctaaaat agaagaacaa gaagagcaaa ggaaggtcca 3300 gcaactcatg accaaagaga agagaagacc aggtgtccag agagtggacg aaggtgggtg 3360 gaacactgta caaggggcca agaacagtcg ggtactggac ccctcaaaat tcctaaaaat 3420 cactaagcct acaattgatg aaaaaattca gctggtacct aaagcacagc taggcagctg 3480 gggaaaaggc agcagtggtg gagcaaaggc aagtgagact gatgccttac ggtcaagtgc 3540 ttccagttta aacagattct ctgccctgca acctccagca ccctcagggt ccacgccatc 3600 cacgcctgta gagtttgatt cccgaaggac cttaactagt cgtggaagta tgggcaggga 3660 gaagaatgac aagccccttc catctgcaac agctcggcca aatactttca tgaggggtgg 3720 cagcagtaaa gacctgctag acaatcagtc tcaagaagag cagcggagag agatgctgga 3780 gaccgtgaag cagctcacag gaggtgtgga tgtggagagg aacagcactg aggctgagcg 3840 aaataaaaca agggagtcag caaaaccaga aatttcagca atgtcagctc atgacaaggc 3900 tgcattatca gaagaggaac tggagaggaa gtcgaaatct atcattgatg aatttctaca 3960 cattaatgat tttaaggaag ccatgcagtg tgtggaagag ctgaatgccc agggcctact 4020 acatgttttt gtgagagtgg gagtggagtc caccctggaa aggagccaga tcaccaggga 4080 tcacatgggc caactactct atcagctggt acagtcagaa aaactcagca aacaggactt 4140 tttcaaaggt ttttcagaaa ctttggaatt ggcagatgac atggccattg atattcccca 4200 tatttggttg taccttgctg aactggtgac ccccatgtta aaagaaggtg gaatctccat 4260 gagagaactt accatagaat ttagcaaacc tttacttcct gttggaagag ctggggtctt 4320 gctatctgaa atattgcacc tactatgcaa acaaatgagc cataagaaag tgggagcctt 4380 atggagggag gctgacctca gctggaagga ctttttacca gaaggagaag atgtacataa 4440 ttttcttttg gagcagaagt tggacttcat agagtctgac agtccctgtt cctctgaagc 4500 actttcaaag aaagaactgt ctgccgaaga gctgtataag cgactcgaga aactcattat 4560 tgaggacaaa gcgaatgatg aacagatctt tgactgggta gaggctaatc tagacgaaat 4620 ccagatgagt tcacctacat tccttagagc tttaatgact gctgtttgta aagcagctat 4680 tatagccgac tcttctacct tcagagtgga cactgctgtt atcaagcaga gagtgccgat 4740 cttactcaag tacctagact cagatacaga gaaggaactg caagcacttt atgcactaca 4800 agcatcgata gtaaaacttg atcaacctgc caatttgctg cggatgtttt ttgattgtct 4860 atatgacgag gaggtgatct ccgaggatgc cttctacaaa tgggagagca gcaaggaccc 4920 tgcagagcag aatgggaagg gcgtggctct gaaatctgtc acggcattct tcacgtggct 4980 gcgggaagca gaagaggagt ctgaggataa ctaaaacttc aaatacacaa aatgaaacaa 5040 aagaaacaat ttaagtattt ttttaaaaag tttcacgtct tcgccaatca cagtgcagca 5100 aggccaattc tcgcagaaac cccacgtgtg cacgagtggg agaggggaaa gagaaaaaaa 5160 ggtgtacatg gaggaaaaag gtactggata aaagtaaact tcaaacctta gggcgggagc 5220 actaaaacca aaatacatgt attatttata gaaaatattt tctgttttaa tcttttcttt 5280 ttaaacaagg actcatactt aaaaaaatgt ttagcaaaaa aaaaaaaaag ttgagaactt 5340 ttaatttatt ttaaggactg caaatgccag tgtaattttt taatttgcag tttctgtaaa 5400 caactgtata atagaaaagc agagaaataa atttccctcc ccttcaagat gcacctcatg 5460 tttgttttaa ggtatagcat ttagtccaga tttgagaaag tttggggtga acaaggtaag 5520 aaagattttt ttttttttgg catcaaatct ttctgcctgc ctctcagctt gcttcagaaa 5580 atttaaaaaa tcacaatagt aatcaaaaca tacataacat tgaaacagaa ggaaatgctg 5640 tggaccacag aactccaaga attgtttaaa aaaaaaaaag tgctaccctg agaaaagtac 5700 tcttaatatc tcttgaaatc tttagagcaa ctttaaggct tgtaaataca tagaacaaat 5760 atttaaaaaa acaaaaagaa attgactcag tactatttct tttcactttg aaaatataaa 5820 gaacaaaata aagacaaaca ttgcaagttt 5850 278 2903 DNA Homo sapiens 278 ctcaattcgt caccaggagg aagacggagc tggctgccca gcccaaaggc ccatgagggg 60 atgcagttat gggctctgtc gccgtggatt gttattttgt gtcagtaagt aatccataaa 120 gtgccaacat gggaaagaaa cggacaaagg gaaaaactgt tccaatcgat gattcctctg 180 aaactttaga acctgtgtgc agacacatta gaaaaggatt ggaacaaggt aatttgaaaa 240 aggctttagt gaatgtggaa tggaatatct gccaagactg taagactgac aataaagtga 300 aagataaagc tgaagaagaa acagaagaaa agccttcagt ttggctgtgt cttaaatgtg 360 gccatcaggg ctgtggcaga aattctcagg agcagcatgc cttgaagcac tatctgacgc 420 caagatctga acctcactgt ctggttctta gtttggacaa ctggagtgta tggtgttacg 480 tatgtgataa tgaggtccag tattgtagtt caaaccagtt gggtcaagtg gttgattatg 540 tcagaaaaca agccagcatt acaactccaa agccagcaga gaaagataat ggaaatattg 600 aacttgaaaa taaaaaatta gaaaaagaga gtaagaatga acaagagaga gaaaagaagg 660 aaaacatggc taaagagaat cctcccatga attctccttg ccaaataacc gtgaaaggac 720 tcagtaattt gggaaacaca tgtttcttca atgcagttat gcagaacttg tcacaaacac 780 cagtgcttag agaactacta aaagaagtga aaatgtctgg aacaattgta aaaattgaac 840 cacctgattt ggcattaaca gaaccattag aaataaacct tgagcctcca ggccctctta 900 ctttagccat gagccagttt cttaatgaga tgcaagagac caaaaagggg gttgtgacac 960 cgaaagaact cttttctcag gtctgtaaaa aagcagtgcg gtttaaaggc tatcagcagc 1020 aagacagcca ggagctgctt cgctacttat tggatgggat gagagcagaa gaacaccaaa 1080 gagtgagtaa aggaatactt aaagcatttg gtaattctac tgaaaagttg gatgaagaac 1140 taaaaaataa agttaaagat tatgagaaga aaaaatcaat gccaagtttt gttgaccgca 1200 tctttggtgg tgaactaact agtatgatca tgtgtgatca atgcagaact gtctccttgg 1260 ttcatgaatc tttccttgat ttgtccctcc cagttttaga tgatcagagt ggtaagaaaa 1320 gtgtaaatga taaaaatctg aaaaagacag tggaggatga agatcaagat agtgaggaag 1380 aaaaagataa cgacagttac ataaaagaga gaagtgatat tccttctgga acaagtaagc 1440 acttacagaa aaaagcaaag aaacaagcca aaaagcaagc caagaaccaa cgaagacaac 1500 aaaaaattca aggaaaagtt cttcatttaa atgatatttg tactattgac catcctgaag 1560 acagtgaata tgaagctgaa atgtcacttc aaggagaagt aaatattaaa tccaaccata 1620 tttcacaaga gggtgttatg cataaagaat attgtgtcaa ccagaaagat ttgaatggcc 1680 aagcaaaaat gatcgaaagt gtaactgaca atcaaaaatc cacagaggaa gtagatatga 1740 aaaatatcaa catggataat gatctggagg ttttaacatc ttctcccact aggaatttaa 1800 atggtgccta cctaacggaa gggagcaatg gagaagtgga catttccaat ggtttcaaaa 1860 acctaaattt gaatgctgct cttcatcctg atgaaataaa tatagagatt ctgaatgata 1920 gtcatactcc tggaacaaag gtgtatgagg ttgtaaatga agatccagaa actgctttct 1980 gtactcttgc aaacagggaa gttttcaata ctgatgagtg ttcaatccaa cattgtttat 2040 atcagttcac ccgtaatgag aaacttcgag atgcgaataa actgctttgt gaagtatgca 2100 cacggagaca gtgtaatgga ccaaaggcaa atataaaagg tgaaaggaag catgtttaca 2160 ccaatgccaa aaagcagatg ctaatttctc ttgctcctcc tgttcttact cttcatttaa 2220 agagatttca gcaggctggt tttaacctac gcaaagttaa caaacacata aagtttccgg 2280 aaatcttaga tttggctcct ttttgcaccc ttaaatgtaa gaatgttgca gaagaaaata 2340 caagggtact ctattcctta tatggagttg ttgaacacag tggtactatg aggtcggggc 2400 attacactgc ctatgccaag gcaagaaccg caaatagtca tctctctaat cttgttcttc 2460 acggtgatat tccacaagat tttgaaatgg aatcaaaagg gcagtggttt cacatcagcg 2520 acacacatgt gcaagctgtg cctacaacta aagtactaaa ctcacaagcg tacctcctat 2580 tttatgagag aatactgtaa taatatcaaa agcacttttt ctggaaacac atttatggct 2640 tttataatgg ctgaaataac gataaaaaaa gactaattaa aatcatgttc acttaacatt 2700 aaatacatgc cagaagaaat catgtttatt taaatattga agggaaaaat acctaaaaat 2760 gtacaaaggt tttatattgt catagtggtt tttattcctg ctttgtttct ggaaaggaaa 2820 tcctgaatta cttaagtact ttgtgtttaa tatatctggg tgatggatca caacacatca 2880 ataaactgac ttaccctaaa atc 2903 279 873 DNA Homo sapiens 279 tagagagccc cggagccgcg gcgggagagg aacgcgcagc cagccttggg aagcccaggc 60 ccggcagcca tggcggtgga aggaggaatg aaatgtgtga agttcttgct ctacgtcctc 120 ctgctggcct tttgcgcctg tgcagtggga ctgattgccg tgggtgtcgg ggcacagctt 180 gtcctgagtc agaccataat ccagggggct acccctggct ctctgttgcc agtggtcatc 240 atcgcagtgg gtgtcttcct cttcctggtg gcttttgtgg gctgctgcgg ggcctgcaag 300 gagaactatt gtcttatgat cacgtttgcc atctttctgt ctcttatcat gttggtggag 360 gtggccgcag ccattgctgg ctatgtgttt agagataagg tgatgtcaga gtttaataac 420 aacttccggc agcagatgga gaattacccg aaaaacaacc acactgcttc gatcctggac 480 aggatgcagg cagattttaa gtgctgtggg gctgctaact acacagattg ggagaaaatc 540 ccttccatgt cgaagaaccg agtccccgac tcctgctgca ttaatgttac tgtgggctgt 600 gggattaatt tcaacgagaa ggcgatccat aaggagggct gtgtggagaa gattgggggc 660 tggctgagga aaaatgtgct ggtggtagct gcagcagccc ttggaattgc ttttgtcgag 720 gttttgggaa ttgtctttgc ctgctgcctc gtgaagagta tcagaagtgg ctacgaggtg 780 atgtaggggt ctggtctcct cagcctcctc atctggggga gtggaatagt atcctccagg 840 tttttcaatt aaacggatta ttttttcaga ccg 873 280 1721 DNA Homo sapiens 280 gaattccctg gctgcttgaa tctgttctgc cccctcccca cccatttcac caccaccatg 60 acaccgggca cccagtctcc tttcttcctg ctgctgctcc tcacagtgct tacagttgtt 120 acaggttctg gtcatgcaag ctctacccca ggtggagaaa aggagacttc ggctacccag 180 agaagttcag tgcccagctc tactgagaag aatgctgtga gtatgaccag cagcgtactc 240 tccagccaca gccccggttc aggctcctcc accactcagg gacaggatgt cactctggcc 300 ccggccacgg aaccagcttc aggttcagct gccacctggg gacaggatgt cacctcggtc 360 ccagtcacca ggccagccct gggctccacc accccgccag cccacgatgt cacctcagcc 420 ccggacaaca agccagcccc gggctccacc gcccccccag cccacggtgt cacctcggcc 480 ccggacacca ggccgccccc gggctccacc gcccccccag cccacggtgt cacctcggcc 540 ccggacacca ggccgccccc gggctccacc gcgcccgcag cccacggtgt cacctcggcc 600 ccggacacca ggccggcccc gggctccacc gcccccccag cccatggtgt cacctcggcc 660 ccggacaaca ggcccgcctt ggcgtccacc gcccctccag tccacaatgt cacctcggcc 720 tcaggctctg catcaggctc agcttctact ctggtgcaca acggcacctc tgccagggct 780 accacaaccc cagccagcaa gagcactcca ttctcaattc ccagccacca ctctgatact 840 cctaccaccc ttgccagcca tagcaccaag actgatgcca gtagcactca ccatagcacg 900 gtacctcctc tcacctcctc caatcacagc acttctcccc agttgtctac tggggtctct 960 ttctttttcc tgtcttttca catttcaaac ctccagttta attcctctct ggaagatccc 1020 agcaccgact actaccaaga gctgcagaga gacatttctg aaatgttttt gcagatttat 1080 aaacaagggg gttttctggg cctctccaat attaagttca ggccaggatc tgtggtggta 1140 caattgactc tggccttccg agaaggtacc atcaatgtcc acgacgtgga gacacagttc 1200 aatcagtata aaacggaagc agcctctcga tataacctga cgatctcaga cgtcagcgtg 1260 agtgatgtgc catttccttt ctctgcccag tctggggctg gggtgccagg ctggggcatc 1320 gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg 1380 gctgtctgtc agtgccgccg aaagaactac gggcagctgg acatctttcc agcccgggat 1440 acctaccatc ctatgagcga gtaccccacc taccacaccc atgggcgcta tgtgccccct 1500 agcagtaccg atcgtagccc ctatgagaag gtttctgcag gtaatggtgg cagcagcctc 1560 tcttacacaa acccagcagt ggcagccact tctgccaact tgtaggggca cgtcgccctc 1620 tgagctgagt ggccagccag tgccattcca ctccactcag ggctctctgg gccagtcctc 1680 ctgggagccc ccaccacaac acttcccagg catggaattc c 1721 281 5988 DNA Homo sapiens 281 gggccttgcg ccgcgggagc ggacggcggc ggaggagacc ctaggctcgc ggcccgaggc 60 gggagggctc ggcttctcga ctgcccgcct ccgaggccgc cggccgcttc tctctcccag 120 agtggccgcc gcccctggag actcgccgtg acacgggcct aagccgccgc cgcgggagtc 180 ctgaccgctc ggacccgtcg gatcaggccg ggtgggagcg agcttgcggg caggtgccgc 240 tcccggaggg tgggccggag gcgaggcgcc caccgcgcgg ctcgcgggcc gggctcggcg 300 gaggggccgc tcgcgcagca cccccaccgc gggccggagc ccgggtcgcc gccccgcctt 360 ctcccgggac cgcccggccg gagctgcggg ggccgaggga cgccgcgccc gccgccgcca 420 gccgggctcg cgcgggagag cagggaagag aaactttgcc ttttattgtt tttagtcctt 480 aagtgcaagg aactctgtgt tgggaggaaa aatgtccttc ttcagtttcc gtaagatctt 540 caagttgggg agcgagaaga agaagaagca gtacgaacac gtgaagaggg acctgaaccc 600 cgaagacttt tgggagatta taggagaact gggcgacgga gcctttggga aagtgtacaa 660 ggcccagaat aaagagacca gtgttttagc tgctgcaaaa gtgattgaca ctaaatctga 720 agaagaactt gaagattaca tggtagagat tgacatatta gcatcttgtg atcacccaaa 780 tatagtcaag cttctagatg ccttctatta tgagaacaat ctttggatcc tcattgaatt 840 ttgtgcaggt ggagcagtag atgctgtgat gcttgaactt gagagaccat taactgagtc 900 ccaaatacaa gtagtttgca agcagacttt agatgcattg aactacttac atgataataa 960 gatcatccac agagatctga aggctggcaa cattctcttt accttagatg gagatatcaa 1020 attggcggat tttggagtat cagctaaaaa cacgaggaca attcaaagaa gagattcctt 1080 tattggtaca ccatattgga tggctcctga agtagtcatg tgtgaaacat ctaaggacag 1140 accctatgac tacaaagctg atgtttggtc cctgggtatc actttaatag aaatggctga 1200 gatagaacca cctcatcatg aattaaatcc aatgcgagtg ctgctaaaaa tagcaaaatc 1260 tgagccacct acattagcac agccatccag atggtcttca aattttaagg actttctaaa 1320 gaaatgctta gaaaagaatg tggatgccag gtggactaca tctcagctgc tgcagcatcc 1380 ctttgttact gttgattcca acaaacccat ccgagaattg attgcagagg cgaaggctga 1440 agtaacagaa gaagttgaag atggcaaaga ggaagatgaa gaggaggaaa cagaaaattc 1500 tctgccaata cctgcaagta agcgtgcatc ttctgacctt agtatcgcca gctctgaaga 1560 agataaactt tcacaaaatg cttgtatttt ggagtctgtc tcagaaaaaa cagaacgtag 1620 taactctgaa gataaactca acagcaaaat tcttaatgaa aaacccacca ctgatgaacc 1680 tgaaaaggct gtggaggata ttaatgaaca tattaccgat gctcagttag aagcaatgac 1740 tgaactccat gacagaacag cagtaatcaa ggagaatgaa agagagaaga ggcccaagct 1800 tgaaaatctg cctgacacag aagaccaaga aactgtggac attaattcag tcagtgaagg 1860 aaaagagaat aatataatga taaccttaga aacaaatatt gaacataatc taaaatctga 1920 ggaagaaaag gatcaggaaa agcaacagat gtttgaaaat aagcttataa aatctgaaga 1980 aattaaagat actattttgc aaacagtaga tttagtttct caagagactg gagaaaaaga 2040 ggcaaatatt caggcagttg atagtgaagt tgggcttaca aaggaagaca cccaagagaa 2100 attgggggaa gacgacaaaa ctcaaaaaga tgtgatcagc aatacaagtg atgtgatagg 2160 aacatgtgag gcagcagatg tggctcagaa agtggatgaa gacagtgctg aggatacgca 2220 gagtaatgat gggaaagaag tggtcgaagt aggccagaaa ttaattaata agcccatggt 2280 gggtcctgag gctggtggta ctaaggaagt tcctattaaa gaaatagttg aaatgaatga 2340 aatagaagaa ggtaaaaata aggaacaagc aataaacagt tcagagaaca taatggacat 2400 caatgaggaa ccaggaacaa ctgaaggtga agaaatcact gagtcaagta gcactgaaga 2460 aatggaggtc agaagtgtgg tggctgatac tgaccaaaag gctttaggaa gtgaagttca 2520 ggatgcttct aaagtcacta ctcagataga taaagagaaa aaagaaattc cagtgtcaat 2580 taaaaaagag cctgaagtta ctgtagtttc acagcccact gaacctcagc ctgttctaat 2640 acccagtatt aatatcaact ctgacagtgg agaaaataaa gaagaaatag gttctttatc 2700 aaaaactgaa actattctgc caccagaatc tgagaatcca aaggaaaatg ataatgattc 2760 aggcactggt tccactgctg atactagcag tattgacttg aatttatcca tctctagctt 2820 tctaagtaaa actaaagaca gtggatcgat atctttacaa gaaacaagaa gacaaaagaa 2880 aacattgaag aaaacacgca aatttattgt tgatggtgta gaagtgagtg taacaacatc 2940 aaagatagtt acagatagtg attccaaaac tgaagaattg cggtttctta gacgtcagga 3000 acttcgggaa ttaagatttc ttcagaaaga agagcaaaga gcccaacaac agctcaatag 3060 caaactacag caacaacgag aacaaatttt ccggcgcttt gagcaggaaa tgatgagtaa 3120 aaagcgacaa tatgaccagg aaattgagaa tctagaaaaa cagcagaaac agactatcga 3180 acgcctggaa caagagcaca caaatcgctt gcgagatgaa gccaaacgca tcaaaggaga 3240 acaagagaaa gagttgtcca aatttcagaa tatgctgaag aaccgaaaga aggaggaaca 3300 agagtttgtt cagaaacaac agcaagaatt agatggctct ctgaaaaaga tcatccagca 3360 gcagaaggca gagttagcta atattgagag agagtgcctg aataacaagc aacagctcat 3420 gagagctcga gaagctgcaa tttgggagct cgaagaacga cacttacaag aaaaacacca 3480 gctgctcaaa cagcagctta aagatcagta tttcatgcaa agacatcagc tacttaagcg 3540 ccacgagaag gaaacagagc aaatgcagcg ttacaatcaa agacttattg aggaattgaa 3600 aaacagacag actcaagaaa gagcaagact gcccaagatt cagcgcagtg aagccaagac 3660 tcgaatggcc atgtttaaga agagtttgag aattaactca acagccacac cagatcagga 3720 ccgtgataaa attaaacagt ttgctgcaca agaagaaaag aggcagaaaa atgagagaat 3780 ggctcagcat cagaaacatg agaatcaaat gcgagatctt cagttgcagt gtgaagccaa 3840 tgtccgcgaa ctgcatcagc tgcagaatga aaaatgccac ttgttggttg agcatgagac 3900 tcagaaactg aaggagttag atgaggaaca tacaagaatt aaaggagtgg agagagaaat 3960 tgagacctag gaaaaagaca ctggaagaag agtttgccag gaaactacag gaacaggaag 4020 tattctttaa aatgactggg gagtctgaat gccttaaccc atcaacacag agccggattt 4080 ccaaatttta tcctattccc agcttgcatt ccaccggatc ataacaaagg gaagcattct 4140 gtgcgtgggt ttggctcttt cagtatgtca ttctgttctc atcttctgcc acagtctctc 4200 agatagctca tgaagacaat cacctgcctc accttctagg tgttttcctt ttttgttttt 4260 tttgttttgt tttgttttta agcaaagatg aagggaaaac gaactaagac agacgctagg 4320 ccatgttggc aaagtagcat cttggtgact aaggtgactt tgtatattca tcttaaaaat 4380 tatgttcttt agacactgct acctgaaaac tgttggagaa ataatgttta aagttattta 4440 agaaaaactg ttacatcact aagtattaat aaattcttct tacctgacgt aacttctcaa 4500 tgcctaaatt ctgtagttga agctctgctg cagagagttg ggataatttt cttttggtgg 4560 atcagctctc ataaaaaagc tatgatttgc tcaaatatgc tgttgactca gtaaatgaat 4620 atattttttt ttttaaatag gaacaacctc ttttaaaaga gaaaaattat ttcagtgatt 4680 tgtcaaaacg aattacctct tttggcatga gctaataatt gagggtgcta attttcttaa 4740 gatagtgcct aaaacactaa atttcagtca agtcgtaagt aggattttct ttttgatcaa 4800 cagggacaaa aacatcttta gaattaaaaa catggttgtt ttggaatttt tgcttctctt 4860 accgtttgat agaaattttc atcctaaaat acatgtacaa agtttggaaa gatgaaaaaa 4920 agaggtagct tttagattgc aaattggaaa tgtaaaactc atgaaattta agcaatatag 4980 gtttagctat ctgtgtttat tttctaaaat aatacctgag ctggttaaat gatttctctc 5040 catcttagct aattctgttt aaaactctgt cagaggcctg caggctgtga gttatattta 5100 taaatatatc ttcagaaatt aatcttaaaa gaggcattag ttcagaatac ttttttaaaa 5160 gtttaaatta aatatttagg cacgtcagaa attacttttc cttattttga aatgaggcta 5220 cttatgtctt ggttttattt tgttccatgt ttaaatcatt cagtttgatt tgagtgggaa 5280 aagcctgaag cctttatcat gtggttgctg gtgtgtgtaa ttattaatga aatgttcact 5340 cctagtccct tatgaggctt agaatttcaa ccacgtgtca ggtcagacag tattataaac 5400 tgtactttgc tgtctgagac agcacatttg tgaatgatgc ttgctgcctg ccattttcaa 5460 cctattctct cttaagagtg ctaggtacca aattgtgaaa gtttgttttc agttatatta 5520 cttttgaggc tggtgaaaaa tttaaatgta actttgtggg aacactgatt catatttaga 5580 aaatgtaaat gtctgtagca ctttcttgca gttaatttga aaactttgga tgctgaacct 5640 tgtttgtcag tgatttagat gatttaaaaa tgcatgtgtg atttgaattt tataattgtt 5700 ttgacaagca taatttactt ggacaacttc gtaggtagcc ttaacttctg gccaagtttg 5760 ttttttatat aaatatatat acatatatac atattatgta tggttgtaaa ttcatacact 5820 tatcacatga atgtgttact gtatacaaaa ctcttaatgc tttattctca aatgctgggt 5880 tgaaaaatgt tttgaaagcc ttttaaaata tatatcttta taaagtaata ttcaggatga 5940 tgataaaaat tgtttatatt gttatgataa aaatgacagt ataatgtt 5988 282 1811 DNA Homo sapiens 282 gagcgcgcgg ggtggggggg gtcctggtct ttggcttctc gactcggtcc tgtttcgaca 60 gcgaacatgt cgcggcctgt cagaaatagg aaggttgttg attactcaca gtttcaggaa 120 tctgatgatg cagatgaaga ttatggaaga gattcgggcc ctcccactaa gaaaattcga 180 tcatctcccc gagaagctaa aaataagagg cgatctggaa agaattcaca ggaagatagt 240 gaggactcag aagacaaaga tgtgaagacc aagaaggatg attctcactc agcagaggat 300 agtgaagatg aaaaagaaga tcataaaaat gtgcgccaac aacggcaggc ggcatctaaa 360 gcagcttcta aacagagaga gatgctcatg gaagatgtgg gcagtgagga agaacaagaa 420 gaggaggatg aggcaccatt ccaggagaaa gattccggca gcgatgaaga tttcctaatg 480 gaagatgatg acgatagtga ctatggcagt tcgaaaaaga aaaacaaaaa gatggttaag 540 aagtccaaac ctgaatgaaa agaaaagaaa atgcccaaac ccagactaaa ggctacagtg 600 acgccaagtc cagtgaaagg caaagggaaa gtgggtcgcc ccacagcttc aaaggcatca 660 aaggaaaaga ctccttctcc caaagaagaa gatgaggaac cggaaagcct gccagaaaag 720 aaaacatcta caagcccccc acccgagaaa tctggggatg aagggtctga agatgaagcc 780 ccttctgggg aggattaaaa gtgatgatgg tctggggaga gattttatta aaaaaaaaaa 840 gaaaaaaaaa gaaaaaagag ggaggaaaaa aaaagaacct acttaagata gaacatggtt 900 ttggctatgg cttgactcat gggctttcag tgcttttttc catttgtcga aagtaacatt 960 tctctctctc tctctttttt tttttttttt ttttaaagca aaccattgta tgtgtaagtg 1020 tttaagttac ctttttgtct attggtctct ttgccagccc tcccctttcc caatgaaagc 1080 catgtcaaat taatcactgg attgactgct tcatcttttt atttttaatg aaaggtgtac 1140 cacggttgta aagcaataag atttgagatg aacactattg aaacttcgct ttttgctaaa 1200 aaatagcaag ttgaatagta atcaaaaaac atagaaagat tttagttcaa aatgattgct 1260 cctttctcta cctggacttt taaaaaatca attgtcatct aatatgagtt tatttgtcta 1320 tagacacaag tatcaatgtc taaaaaaaat catgacttta aacttccacc gatgaggcag 1380 gtaggagata aagatgaatt ctgaactgtt actaaaagta ctcatttttt accttgtagg 1440 gagggtgggc aatggggtta cccgacctta tttgagggta tgggctttct tttttatttc 1500 atcacttgtt atctcaaaga gactcggagc cagtgatcct tttatcctgc tacagtcttt 1560 agggagctaa aaaaaaaaaa aaagcagggg ctgccaaaac tcttgatttc atatttcctt 1620 ctctaaatat atatgtatcc tgttttttgg ataaaatttt accaagaatc caaaaaaaaa 1680 aaaaccctag aatttaatca acaagatcag tctacaggtc acagtggatt tcttttcaaa 1740 ctgacaatgt ttaggtttta agcaaataaa gttccagtta atgtgaaact cagaaaaaaa 1800 aaaaaaaaaa a 1811 283 943 DNA Homo sapiens 283 gcccgcagca cctcctcgcc agcagccgtc cggagccagc caacgagcgg aaaatggcag 60 acaatttttc gctccatgat gcgttatctg ggtctggaaa cccaaaccct caaggatggc 120 ctggcgcatg ggggaaccag cctgctgggg cagggggcta cccaggggct tcctatcctg 180 gggcctaccc cgggcaggca cccccagggg cttatcctgg acaggcacct ccaggcgcct 240 accctggagc acctggagct tatcccggag cacctgcacc tggagtctac ccagggccac 300 ccagcggccc tggggcctac ccatcttctg gacagccaag tgccaccgga gcctaccctg 360 ccactggccc ctatggcgcc cctgctgggc cactgattgt gccttataac ctgcctttgc 420 ctgggggagt ggtgcctcgc atgctgataa caattctggg cacggtgaag cccaatgcaa 480 acagaattgc tttagatttc caaagaggga atgatgttgc cttccacttt aacccacgct 540 tcaatgagaa caacaggaga gtcattgttt gcaatacaaa gctggataat aactggggaa 600 gggaagaaag acagtcggtt ttcccatttg aaagtgggaa accattcaaa atacaagtac 660 tggttgaacc tgaccacttc aaggttgcag tgaatgatgc tcacttgttg cagtacaatc 720 atcgggttaa aaaactcaat gaaatcagca aactgggaat ttctggtgac atagacctca 780 ccagtgcttc atataccatg atataatctg aaaggggcag attaaaaaaa aaaaaagaat 840 ctaaacctta catgtgtaaa ggtttcatgt tcactgtgag tgaaaatttt tacattcatc 900 aatatccctc ttgtaagtca tctacttaat aaatattaca gtg 943 284 1465 DNA Homo sapiens 284 tcagtcccag cagttccctt cgtgcgcggg gggcggcgag ggtcttcagc agtcgggaga 60 ggcccttgac ggcgccatgt cggcgggcgg tccatgccca gcagcagccg gagggggccc 120 agggggcgcc tcctgctccg tgggggcccc tggcggggta tccatgttcc ggtggctgga 180 ggtgctggag aaggagttcg acaaagcttt tgtggatgtg gatctgctcc tgggagagat 240 cgatccagac caagcggaca tcacttatga ggggcgacag aagatgacca gcctgagctc 300 ctgctttgca cagctttgcc acaaagccca gtctgtgtct caaatcaacc acaagctgga 360 ggcacagttg gtggatctga aatctgaact gacagaaacc caagcagaga aagttgtttt 420 ggagaaagaa gtacatgatc agcttttaca gctgcactct attcagctgc agcttcatgc 480 taaaactggt caaagtgctg actctggtac cattaaggca aaattgtctg gcccctctgt 540 ggaggagctg gaaagagagc ttgaggcaaa caaaaaagaa aaaatgaaag aagcacaact 600 tgaagctgaa gtgaaattgt tgagaaaaga gaatgaagcc cttcgtagac atatagctgt 660 tctccaggct gaagtatatg gggcgagact agctgccaag tacttggata aggaactggc 720 aggaagggtc caacagatac aattgctagg acgagatatg aagggacctg ctcatgataa 780 gctttggaac caattagaag ctgaaataca tttgcatcgt cacaaaactg tgatccgagc 840 ctgcagagga cgtaatgact tgaaacgacc aatgcaagca ccaccaggcc atgatcaaga 900 ttccctaaag aaaagccaag gtgttggtcc aattagaaaa gttctcctcc ttaaggaaga 960 tcatgaaggc cttggcattt caattacagg tgggaaagaa catggtgttc caatcctcat 1020 ctctgagatc catccggggc aacctgctga tagatgcgga gggctgcacg ttggggatgc 1080 tattttggca gtcaacggag ttaacctaag ggacacaaag cataaagaag ctgtaactat 1140 tctttctcag cagagaggag agattgaatt tgaagtagtt tatgtggctc ctgaagtgga 1200 ttctgatgat gaaaacgtag agtatgaaga tgagagtgga catcgttacc gtttgtacct 1260 tgatgagtta gaaggaggtg gtaaccctgg tgctagttgc aaagacacaa gtggggaaat 1320 caaagtatta caaggattta ataagaaggc agtaactgac acacatgaaa atggagacct 1380 gggcactgca agtgaaactc cgctagatga cggtgcttca aaattagatg atctgcacac 1440 tctgtatcat aaaaaatcgt attaa 1465 285 1709 DNA Homo sapiens 285 attttgaatg tgcagctgca gcgggcgtga gttgggggag gacgggttgc cgactcgcct 60 acctagcggt ctcttgattg tcgatatttt gttggcatag gtttatgtag agacgtatac 120 atatatatag acacactgtc tataaatcta ggcctgtatc cggtgtccga ggcgaactca 180 gtaagatgat gttaagagga aacctgaagc aagtgcgcat tgagaaaaac ccggcccgcc 240 ttcgcgccct ggagtccgcg gtgggcgaga gcgagccggc ggccgcggca gccatggcgc 300 tcgctcttgc cggggagccg gcaccgcccg cgcccgcgcc tccagaggac cacccggacg 360 aggagatggg gttcactatc gacatcaaga gtttcctcaa gccgggcgag aagacgtaca 420 cgcagcgctg ccgcctcttc gtgggaaatc tgcccaccga catcacggag gaggacttca 480 agaggctctt cgaacgctat ggcgagccca gcgaagtctt catcaaccgg gaccgtggct 540 tcggcttcat ccgcttggaa tccagaaccc tggctgaaat tgcaaaagca gagctggacg 600 gcaccattct caagagcaga cctctacgga ttcgcttcgc tacacatgga gcagccttga 660 ctgtcaagaa cctttctcca gttgtttcca atgagctgct agagcaagca ttttctcagt 720 ttggtccagt agagaaagct gttgtggttg tggatgatcg cggtagagct acaggaaaag 780 gttttgtaga gtttgcagca aaacctcctg cacgaaaggc tctggaaaga tgtggtgatg 840 gggcattctt gctaacaacg acccctcgtc cagtcattgt ggaacccatg gagcagtttg 900 atgatgaaga tggcttgcca gagaagctga tgcagaaaac tcaacaatat cataaggaaa 960 gagaacaacc accacgtttt gctcaacctg ggacatttga atttgagtat gcatctcgat 1020 ggaaggctct tgatgaaatg gaaaagcagc agcgtgagca ggttgataga aacatcagag 1080 aagccaaaga gaaactggag gcagaaatgg aagcagctag gcatgaacac caattaatgc 1140 taatgaggca agatctaatg aggcgtcaag aagaactcag acgcttggaa gaactcagaa 1200 accaagagtt gcaaaaacgg aagcaaatac aactaagaca tgaagaggag catcggcggc 1260 gtgaggaaga aatgatccga cacagagaac aggaggaact gaggcgacag caagagggct 1320 ttaagccaaa ctacatggaa aatggtgata aaagaaaatg tggctgaagt tacccgaagt 1380 tcagcttgca gtgtaattca gaagagttaa gaccagaagt actggaatca tacttccttt 1440 caggtcaaaa cattttgtgg gaaatatcct gttcgtgacc taggagaaac aagacagaaa 1500 acaaatcttt cttcatttat gaatttctta ctccatacaa gcagtttatg gatgtctggg 1560 caatcatagc acttgccatt taaaaacatg ctacaggggc acattttctg tggttaaaaa 1620 tgatctccag aataagtgta aactgcagtt ttcctttgca ttgagggttt tggcttttta 1680 tttgtaaatt aaatcaagat gttacccac 1709 286 1180 DNA Homo sapiens 286 caaatttgaa attatacaca cacatcagac aaaaaggaaa atcgggttct acatcagcaa 60 tctgtcttaa cttcagcatc acacaactac ctatgaagaa gacagaacaa gaattcagtt 120 cttctggtct gtacttctca gaaatgtata aacaattttt atcaacatta aaactgtcct 180 agcaaaacca aacaatcact atcttttcac acagctgaca ctttcccacc atttgccatc 240 tcatcgctgg agggcacaga aaaacgacac tcaaaatgct tgtgcttttg aggctctgct 300 atatacacta agtgaactac tggcttccag ttgggaatgg agggctgagg acaccctaat 360 tcagtgcttc cgcagctagc actcttcagc cttccctgcg ttccccttac ctgagctggg 420 ggatgaagag cagggtgaca gtgggagtaa gtacagaatt cctcaagcct tggtagcctt 480 agagaaatgc tctgggctta tactgaggct catctaccct aaaggccatc tctcaggaaa 540 acagagctgt tgtatcataa agcttttcag ggggaaaaaa ataaaaaatg ttcatcacaa 600 atctattgtg cttgttataa tctaacaaag tccactaaat gtagtaacga aatgtctccc 660 caataaaaaa ttaagattcc ctttggagct taggcatgtg gccttataaa gagcttaact 720 ggactgctac ctgttggtat gttcagggct tcattaggtt catgcttaag tataatgcct 780 gtgcctcaca gaacaattta aatttagctt aacaccccac ccatacttaa agagcaaata 840 aattatattt cgaaaagatt ttaaaataat ctcaatataa aaaccaacag tgggaactga 900 gtctggaata aatgtgtgca gaatcctaat tcttttctga gctccactca gtgaggatat 960 gcaccctcta ctgtccagcc tgtggtttgc agtaagctca ctcttctgtt cagaatgacc 1020 acaggagggc ctttacctag gaatagcagg aaaagctaag taagtactca aacttccctc 1080 ccagtaagca tgaattgctt cagatgtaaa gcttacatgt gatgagtaat actataatat 1140 catacattaa tatatcctaa taaactttaa ttttcaaaca 1180 287 2739 DNA Homo sapiens 287 cgcacgcgca gtcgtatccg tgtgatgggc gggctgttga cggcgctgcg atggctgcct 60 gcgagggcag gagaagcgga gctctcggtt cctctcagtc ggacttcctg acgccgccag 120 tgggcggggc cccttgggcc gtcgccacca ctgtagtcat gtacccaccg ccgccgccgc 180 cgcctcatcg ggacttcatc tcggtgacgc tgagctttgg cgagagctat gacaacagca 240 agagttggcg gcggcgctcg tgctggagga aatggaagca actgtcgaga ttgcagcgga 300 atatgattct cttcctcctt gcctttctgc ttttctgtgg actcctcttc tacatcaact 360 tggctgacca ttggaaagct ctggctttca ggctagagga agagcagaag atgaggccag 420 aaattgctgg gttaaaacca gcaaatccac ccgtcttacc agctcctcag aaggcggaca 480 ccgaccctga gaacttacct gagatttcgt cacagaagac acaaagacac atccagcggg 540 gaccacctca cctgcagatt agacccccaa gccaagacct gaaggatggg acccaggagg 600 aggccacaaa aaggcaagaa gcccctgtgg atccccgccc ggaaggagat ccgcagagga 660 cagtcatcag ctggagggga gcggtgatcg agcctgagca gggcaccgag ctcccttcaa 720 gaagagcaga agtgcccacc aagcctcccc tgccaccggc caggacacag ggcacaccag 780 tgcatctgaa ctatcgccag aagggcgtga ttgacgtctt cctgcatgca tggaaaggat 840 accgcaagtt tgcatggggc catgacgagc tgaagcctgt gtccaggtcc ttcagtgagt 900 ggtttggcct cggtctcaca ctgatcgacg cgctggacac catgtggatc ttgggtctga 960 ggaaagaatt tgaggaagcc aggaagtggg tgtcgaagaa gttacacttt gaaaaggacg 1020 tggacgtcaa cctgtttgag agcacgatcc gcatcctggg ggggctcctg agtgcctacc 1080 acctgtctgg ggacagcctc ttcctgagga aagctgagga ttttggaaat cggctaatgc 1140 ctgccttcag aacaccatcc aagattcctt actcggatgt gaacatcggt actggagttg 1200 cccacccgcc acggtggacc tccgacagca ctgtggccga ggtgaccagc attcagctgg 1260 agttccggga gctctcccgt ctcacagggg ataagaagtt tcaggaggca gtggagaagg 1320 tgacacagca catccacggc ctgtctggga agaaggatgg gctggtgccc atgttcatca 1380 atacccacag tggcctcttc acccacctgg gcgtattcac gctgggcgcc agggccgaca 1440 gctactatga gtacctgctg aagcagtgga tccagggcgg gaagcaggag acacagctgc 1500 tggaagacta cgtggaagcc atcgagggtg tcagaacgca cctgctgcgg cactccgagc 1560 ccagtaagct cacctttgtg ggggagcttg cccacggccg cttcagtgcc aagatggacc 1620 acctggtgtg cttcctgcca gggacgctgg ctctgggcgt ctaccacggc ctgcccgcca 1680 gccacatgga gctggcccag gagctcatgg agacttgtta ccagatgaac cggcagatgg 1740 agacggggct gagtcccgag atcgtgcact tcaaccttta cccccagccg ggccgtcggg 1800 acgtggaggt caagccagca gacaggcaca acctgctgcg gccagagacc gtggagagcc 1860 tgttctacct gtaccgcgtc acaggggacc gcaaatacca ggactggggc tgggagattc 1920 tgcagagctt cagccgattc acacgggtcc cctcgggtgg ctattcttcc atcaacaatg 1980 tccaggatcc tcagaagccc gagcctaggg acaagatgga gagcttcttc ctgggggaga 2040 cgctcaagta tctgttcttg ctcttctccg atgacccaaa cctgctcagc ctggacgcct 2100 acgtgttcaa caccgaagcc caccctctgc ctatctggac ccctgcctag ggtggatggc 2160 tgctggtgtg gggacttcgg gtgggcagag gcaccttgct gggtctgtgg cattttccaa 2220 ggcccacgta gcaccggcaa ccgccaagtg gcccaggctc tgaactggct ctgggctcct 2280 cctcgtctct gctttaatca ggacaccgtg aggacaagtg aggccgtcag tcttggtgtg 2340 atgcggggtg ggctgggccg ctggagcctc cgcctgcttc ctccagaaga cacgaatcat 2400 gactcacgat tgctgaagcc tgagcaggtc tctgtgggcc gaccagaggg gggcttcgag 2460 gtggtccctg gtactggggt gaccgagtgg acagcccagg gtgcagctct gcccgggctc 2520 gtgaagcctc agrtgtcccc aatccaaggg tctggagggg ctgccgtgac tccagaggcc 2580 tgaggctcca gggctggctc tggtgtttac aagctggact cagggatcct cctggccgcc 2640 ccgcaggggg cttggagggc tggacggcaa gtccgtctag ctcacgggcc cctccagtgg 2700 aatgggtctt ttcggtggag ataaaagttg atttgctct 2739 288 4623 DNA Homo sapiens 288 gtcacgcccc gggcagcttg gctggggcta ggcttccggg gctctgcggt cctcggcctg 60 tgctggcagc ctcggagccc accgagccgg gcggctggga tgatgaaccg gacgaccccc 120 gaccaggagc tggtgccggc gtcggagccc gtgtgggagc ggccgtggtc ggtggaggag 180 atccgcagga gcagccagag ctggtcgctg gcggccgacg cgggcctact acagtttcta 240 caggaattct cacagcaaac tatctctagg acccatgaaa tcaagaaaca agtggacgga 300 ctaatccggg aaaccaaagc cacagattgt cgcctgcata atgtcttcaa tgacttcctt 360 atgctctcta atacccagtt catagagaat cgtgtatatg atgaagaagt ggaggagcca 420 gtactcaagg ctgaggcaga aaaaacagag caggagaaga cacgagagca gaaagaagta 480 gatctcattc ctaaagtcca ggaggctgtg aactatggct tacaagtatt ggacagtgcc 540 tttgagcaac ttgatatcaa agcaggaaac tcagactccg aggaagatga tgctaatggg 600 cgggtggaac tgatccttga accaaaggat ctatacattg atcgtccttt accatatctc 660 attgggtcaa agctgttcat ggaacaagaa gatgtaggtc ttggagagct gtccagtgaa 720 gaaggctctg taggcagtga tcgtggcagt attgtggaca ctgaggaaga gaaagaagag 780 gaggagtcag atgaagattt tgcccatcac agtgacaatg aacaaaacca gcacaccaca 840 caaatgagtg atgaggaaga ggatgatgat ggctgtgacc tttttgctga ctctgagaag 900 gaggaggaag atattgagga cattgaagaa aatactagac ctaaaagaag cagacctaca 960 tcgtttgcag atgagctggc tgcccgcatc aagggggatg ccatgggtcg agtggacgag 1020 gagccgacaa ccttaccctc aggagaagca aaacctcgga agacactcaa agagaagaag 1080 gaaaggagaa ctccttcaga cgatgaagag gataacttat tcgcaccccc caagctgacc 1140 gacgaggact tctcgccatt tggctctgga ggtggcctgt tcagtggcgg caaggggcta 1200 tttgatgatg aggacgagga gagtgacctc ttcacggaag cctcccagga tcggcaagct 1260 ggagcctctg ttaaggagga gtcttcatca tccaaacctg gaaagaaaat cccagcagga 1320 gctgtttctg tatttttagg agacacggat gtgtttggtg ctgcctccgt tccatcactg 1380 aaggagccac agaagcctga gcagcccact ccaaggaaaa gcccctatgg tccccctccc 1440 actggcctct ttgatgatga tgatggtgat gatgatgacg actttttctc ggcaccccac 1500 agcaaacctt ctaaaacacg caaagtccaa tccactgccg atatctttgg tgacgaagaa 1560 ggagatctgt tcaaagaaaa agccgtagca tcgccagaag ccactgtgag tcagacagat 1620 gaaaataaag caagagcaga aaaaaaggtt accttatctt ccagcaagaa tctcaagccc 1680 tcatcagaaa caaagactca aaaaggctta ttttcagatg aggaggactc tgaggatttg 1740 ttttcttctc aaagtgcgag taacttaaaa ggtgcatctc tgctgcctgg caagctcccc 1800 acgtcggttt ccctgtttga tgatgaagat gaagaggata atctttttgg gggtacagct 1860 gctaagaagc agacattgtc tctacaagct cagagagaag agaaagcaaa agcctccgag 1920 ctctccaaaa agaaagcatc tgccctgttg ttcagcagtg atgaggagga ccagtggaat 1980 attcctgctt cacagaccca cttagcatct gacagcaggt ctaaaggaga acccagggat 2040 tctgggaccc tccagagcca ggaggccaag gctgtgaaaa agaccagtct ctttgaggaa 2100 gacaaagaag atgatctttt tgccattgcc aaggacagcc aaaagaagac ccagagagtg 2160 tcactcctct ttgaagacga tgttgatagc ggaggctctc tgtttggctc tcctcccaca 2220 tctgttcctc ctgcaacaaa gaaaaaagag actgtctctg aggcaccacc tttgctgttc 2280 agcgatgaag aagagaagga ggcacaactt ggagtgaagt ctgtggataa gaaggttgag 2340 agtgccaagg agtcattaaa atttgggaga actgatgtgg ctgagtcaga aaaggaagga 2400 cttttgacta gatctgctca ggaaacagtc aagcattctg atttattttc ttcatcatcc 2460 ccatgggaca aaggaaccaa gcctagaacc aaaactgttc ttagcttgtt tgatgaggaa 2520 gaggataaaa tggaagatca aaacattatc caggctccac agaaagaagt aggaaagggc 2580 tgcgatcctg atgcccaccc caagagcaca ggtgtcttcc aggatgaaga gctgcttttc 2640 agccacaagc tccaaaagga caatgaccca gatgttgacc tttttgctgg caccaaaaaa 2700 accaagctgt tagagccaag tgttgggagc ctgtttgggg atgatgaaga tgatgatctt 2760 ttcagctctg ccaagtccca gcctttggta caagagaaaa agagagtagt gaaaaaagac 2820 cactctgtta actctttcaa aaaccagaaa catcctgaat ccattcaagg tagtaaagaa 2880 aaaggcatat ggaagccgga aacacctcag gcaaatttag cgatcaaccc agcggccttg 2940 ctgcccacag cggcttccca gatctctgaa gtaaagcctg ttttgccaga attggctttt 3000 ccttcatctg aacacagaag gagccacggt ctggaaagtg tgcctgtcct tcccgggagt 3060 ggggaggccg gtgtgagttt tgatcttcca gctcaggcag acaccttaca cagtgcaaac 3120 aagagccgtg tcaagatgag agggaagcgt agaccgcaga cccgtgcagc taggcggctg 3180 gctgctcagg agtccagcga ggctgaggac atgagcatcc ccagaggacc cattgcacag 3240 tgggctgatg gcgccatttc cccaaatggc catcggccac agctcagagc agccagtgga 3300 gaagacagca ctgaggaggc cctggcagct gccgctgcac cttgggaagg tggtcctgtg 3360 cctggagtgg acacaagccc ctttgcaaag tctctgggtc attccagagg ggaggctgac 3420 ctttttgatt ctggggacat tttttccacg ggcactggat ctcagtccgt ggagagaaca 3480 aaacccaagg caaagatagc agagaatcct gccaacccac cagtgggtgg taaagcaaag 3540 agccccatgt ttcctgctct aggcgaggcc agcagtgatg atgatctctt tcagtctgct 3600 aaaccaaaac cagcaaagaa aacaaatccc tttcctctcc tggaagatga ggatgacctc 3660 tttacagatc agaaagtcaa gaagaatgag acaaaatcca gtagtcagca ggatgtcata 3720 ttaacaacac aagatatttt tgaggatgat atatttgcta cggaagcaat taaaccctct 3780 cagaaaacca gagagaagga gaaaacattg gaatctaatt tatttgatga taacattgat 3840 atctttgctg acttaactgt aaaaccaaaa gaaaagtcca aaaagaaagt ggaagccaag 3900 tctatatttg atgatgatat ggatgacatc ttctcctctg gtatccaggc taagacaacc 3960 aaaccaaaaa gccgatctgc acaggccgca cctgaaccaa gatttgaaca caaggtgtcc 4020 aacatctttg atgatcccct gaatgccttt ggaggccagt agagcacaca gggtatccac 4080 atgttaccct gcagctacat tgttgagtta gtgatgatgt tgtatatgct gatggtctta 4140 actggattac aaaaagcaaa tactagaaca gctagctcat cgttcaccca atgtacttgg 4200 tatttttctg cactggttta atcatgctta atactacaaa acaaaaataa atatttcaca 4260 gtggttggtt tgttttgttt ttaaaccaca gtttgattta gttagccttg ctggggccat 4320 aatatgcttc agggtgtgta aaagaagaaa tctctttgtg gctttcatgg gcagggaatc 4380 tcagagatag caaatgccac ctgaccagaa gtctttgtta tatggatggg aaccctaact 4440 tagggcttgg gcaggggaaa gagaaagaag atgagagatt atacttcatg agtcttagca 4500 atatgggagc aggttttcac tgaattctga gggtgcctct gcatgtcctc caaggcaaag 4560 tttggcaaac tgtggccccc ccactgtcat attttgttaa taaaatttta ttggaacaca 4620 atc 4623 289 3118 DNA Homo sapiens 289 gcagcctgag gaaaaaaaga gaaaaagata aaaaaaatct gaaaacgctt caaaatcctg 60 aaaaaaaaaa ggaaaagaaa aaacgaatcc tcggagaacc cgcggggaag tcactttcgt 120 acgcttccgg cctgccccgc gcccgccgcc gcagcgcttg gcgtccgtcg gtctccgtcc 180 gtcggtccgg gggtgagccg cccgcccggc ccgccgtgcc ctccccccgc tcgggccccg 240 agccccgcgc cccggcctgc cccggcgcac cacgtgtccg tgctgccctt cgccgcccgc 300 ccggggtcgc cgagtcggcg cccacaaaga tttggtttcc ctctgcccgg cggttgtaat 360 cttaaaccgc cggagcccga ggcctatatt tatagagaaa cgcgtgtccc cgaggcccgc 420 cgtgggcagc gtccggtcgc ctcttaaagg atttttaccc ttcggaaggg gattccccgt 480 ttaatttttt tcctactttg attttttgaa atttggagcc ttcgcaccag gaccgcggag 540 aagtgcaaag tcgcggggag ggccgtattg tgcggagagc cttttgtctg cggtgctgcg 600 gccgtgggag ccggcccccg cctcccgttt ccgtcccgtc tccaagcccg ccgactccag 660 ctcgtcctcg ccgcgccggt gccacctgtg agccggcgcg ggcccgggct ccggctccga 720 aggcgcccct ttgtcctgcg gcgggcccga taagaagtcc tctggcgggg ctcggggtgg 780 tggggggcgg ggagatgaac gctgcggcca gcagctaccc catggcctcc ctgtacgtgg 840 gcgacctgca ttcggacgtc accgaggcca tgctgtacga aaagttcagc cccgcggggc 900 ctgtgctgtc catccgggtc tgccgcgata tgatcacccg ccgctccctg ggctatgcct 960 acgtcaactt ccagcagccg gccgacgctg agcgggcttt ggacaccatg aactttgatg 1020 tgattaaggg aaagccaatc cgcatcatgt ggtctcagag ggatccctct ttgagaaaat 1080 ctggtgtggg aaacgtcttc atcaagaacc tggacaaatc tatagataac aaggcacttt 1140 atgatacttt ttctgctttt ggaaacatac tgtcctgcaa ggtggtgtgt gatgagaacg 1200 gctctaaggg ttatgccttt gtccacttcg agacccaaga ggctgccgac aaggccatcg 1260 agaagatgaa tggcatgctc ctcaatgacc gcaaagtatt tgtgggcaga ttcaagtctc 1320 gcaaagagcg ggaagctgag cttggagcca aagccaagga attcaccaat gtttatatca 1380 aaaactttgg ggaagaggtg gatgatgaga gtctgaaaga gctattcagt cagtttggta 1440 agaccctaag tgtcaaggtg atgagagatc ccaatgggaa atccaaaggc tttggctttg 1500 tgagttacga aaaacacgag gatgccaata aggctgtgga agagatgaat ggaaaagaaa 1560 taagtggtaa aatcatattt gtaggccgtg cacaaaagaa agtagaacgg caggcagagt 1620 taaaacggaa atttgaacag ttgaaacagg agagaattag tcgatatcag ggggtgaatc 1680 tctacattaa gaacttggat gacactattg atgatgagaa attaaggaaa gaattttctc 1740 cttttggatc aattaccagt gctaaggtaa tgctggagga tggaagaagc aaagggtttg 1800 gcttcgtctg cttctcatct cctgaagaag caaccaaagc agtcactgag atgaatggac 1860 gcattgtggg ctccaagcca ctatatgttg ccctggccca gaggaaggaa gagagaaagg 1920 ctcacctgac caaccagtat atgcaacgag tggctggaat gagagcactt cctgccaatg 1980 ccatcttaaa tcagttccag cctgcagcgg gtggctactt tgtgccagca gtcccacagg 2040 ctcagggaag gcctccatat tatacaccta accagttagc acagatgagg cctaatccac 2100 gctggcagca aggtgggaga cctcaaggct tccaaggaat gccaagtgct atacgccagt 2160 ctgggcctcg tccaactctt cgccatctgg ctccaactgg gtctgagtgc ccggaccgct 2220 tggctatgga ctttggtggg gctggtgccg cccagcaagg gctgactgac agctgccagt 2280 ctggaggcgt tcccacagct gtgcagaact tagcgccacg cgctgctgtt gctgctgctg 2340 ctccccgggc tgttgccccc tacaaatacg cctccagtgt ccgcagccct catcctgcca 2400 tacagcctct gcaggcaccc cagcctgcgg tccatgtgca ggggcaggag ccactgactg 2460 cctccatgct ggctgcagca cccccccagg aacagaagca gatgctggga gaacgcttgt 2520 tcccactcat ccaaacaatg cattcaaatc tggctgggaa gatcacggga atgctgctgg 2580 agatagacaa ctctgagctg ctgcacatgt tagagtcccc cgagtctctc cgctccaagg 2640 tggatgaagc tgtagcagtt ctacaggctc atcatgccaa gaaagaagct gcccagaagg 2700 tgggcgctgt tgctgctgct acctcttaga caaggaaaaa ccgattcaaa agccaaataa 2760 ccccttatgg aattcaactc aaggtttgaa gacttcctag cttgtcctat ggacctcaac 2820 accaaggatt acaaattgca aatttaatag gtcattttgt atcaaaaggt caattatgaa 2880 gcacctagaa tttttcaatt atacgaatat gttctttggg ttctgctgtg gcccagacag 2940 tgttaacttt ttttttattg tgggttttga ttttttcccc cagaaattgg ttttatttga 3000 tgtacccaag tcttacgttt cccaataaag aaaaaaaatc tccataaaaa aagtcgagcg 3060 gccgcgaatt ccagctgagc gccggtcgct accattaccg ttggtcttgg tgtcaaaa 3118 290 2615 DNA Homo sapiens 290 agtcccgtct caaagaaaat gaacggcacc ctggaccacc cagaccaacc agatcttgat 60 gctatcaaga tgtttgtggg ccaggttcca aggacctggt ctgaaaagga cttgcgggaa 120 ctcttcgaac agtatggtgc tgtgtatgaa atcaacgtcc taagggatag gagccaaaac 180 ccgcctcaga gcaaagggtg ctgttttgtt acattttaca cccgtaaagc tgcattagaa 240 gctcagaatg ctcttcacaa catgaaagtc ctcccaggga tgcatcaccc tatacagatg 300 aaacctgctg acagtgagaa gaacaatgca gtggaagaca ggaagctgtt tattggtatg 360 atttccaaga agtgcactga aaatgacatc cgagtcatgt tctcttcgtt tggacagatt 420 gaagaatgcc ggatattgcg gggacctgat ggcctgagcc gaggttgtgc atttgtgact 480 tttacaacaa gagccatggc acagacggct atcaaggcaa tgcaccaagc acagaccatg 540 gagggttgct catcacccat ggtggtaaaa tttgctgata cacagaagga caaagaacag 600 aagagaatgg cccagcagct ccagcagcag atgcagcaaa tcagcgcagc atctgtgtgg 660 ggaaaccttg ctggtctaaa tactcttgga ccccagtatt tagcacttta tttgcagctc 720 cttcagcaga ctgcctcctc tgggaacctc aacaccctga gcagcctcca cccaatggga 780 gggttgaatg caatgcagtt acagaatttg gctgcactag ctgctgcagc tagtgcagct 840 cagaacacac caagtggtac caatgctctc actacatcca gcagtcccct cagcgtgctc 900 actagttcag ggtcctcacc tagctctagc agcagtaatt ctgtcaaccc catagcctca 960 cttggagccc tgcagacatt agctggagca acggctggcc tcaatgttgg ctctttggca 1020 ggaatggctg ctttaaatgg tggcctgggc agcagtggcc tttccaatgg caccgggagc 1080 accatggagg ccctcactca ggcctactcg ggtatccagc aatatgctgc tgctgcgctc 1140 cccactctgt acaaccagaa tcttctgaca cagcagagta ttggtgctgc tggaagccag 1200 aaggaaggtc cagagggagc caacctgttc atctaccacc tgccccagga gtttggtgat 1260 caggacctgc tgcagatgtt tatgcccttt gggaatgtcg tgtctgccaa ggttttcata 1320 gacaagcaga caaacctgag caagtgtttt ggttttgtaa gttacgacaa tcctgtttcg 1380 gcccaagctg ccatccagtc catgaacggc tttcagattg gcatgaagcg gcttaaagtg 1440 cagctcaaac gttcgaagaa tgacagcaag ccctactgag cgtgctcccc tctgagactg 1500 gagtgagagg gtcttctggt aagtggggga ggagcaccct taatgattcg aagccctgag 1560 gctgtgtgtt gacagccctg gaccctgatc cctgccactc tcgcaggcac agcttgccct 1620 gaagactcgg ctactgcctt ctgtgggagt ttcgcttcgt acagaggaca agtttgtgct 1680 ttggtttcca gtgttttact ttggagttta ggtgccatat cctgaggttt tttttgtttt 1740 tgtttttgtt tctcctttat ttaaagtttg ctgtgtttgt aaccagtgtg tgttgagaag 1800 gaccaacacc aaaccaccct ggggaagggg gacagggaaa catttaacac agacgatgat 1860 caggatttgc cccaaaccac cccaagagag aggactgagt ggaacaaaaa agtgaccccc 1920 agaacctctc ttagtgtgag ggtgggtaga atgagaactg acacctggga gctgtgggga 1980 gcagagcggc tttggggaag agtggagtga ctagacacct aatgccctgg cagctggaga 2040 ctcaaactcc tgtacagcta ccttctggga aatagttttt gacacctatt tttcagattc 2100 ttgtccggaa tttctgctgg ctcttttcaa aaaagggcat taacaattct ctggaaataa 2160 agcacctgtt agcctgacat atgcaaaaag cagggcggca tcacccatta cgagctcccc 2220 cagccagcag tcagtattgg attggccttg cctggctggt ggcagtttgg gagaaacagc 2280 caaagaggtt agtttatttc aacaccaaat tagaccatgg cccatttcca acgggtctct 2340 ttaaaggcct ctatggaata cattgcctgg tttccttctt tgcagttcac cacagcaagg 2400 aatgtcacct ccatcccaag ccaccatttt ctcatgaagg caaatccaag aagggcttgc 2460 agttcttgct gaagggggta ccatttgtgg gcagagtgat caataccatc tagttggggg 2520 aggaggagct tatttcttgg tgtacttgaa tcagaaggtc cctgcaagcc agtatcgctt 2580 ttcttaccta atcacagagt ttgtgtagtg aattt 2615 291 2302 DNA Homo sapiens 291 gggcgagcgg gactggctgg gtcggctggg ctgctggtgc gaggagccgc ggggctgtgc 60 tcggcggcca aggggacagc gcgtgggtgg ccgaggatgc tgcggggcgg tagctccggc 120 gcccctagct ggtgactgct gcgccgtgcc tcacacagcc gaggcgggct cggcgcacag 180 tcgctgctcc gcgcgcgcgc ccggcggcgc tccaggtgct gacagcgcga gagagcgcgg 240 ccctcaggag caaggcgaat gtatgacacc atgtccacaa tggtgtacat aaaggaagac 300 aagttggaga agcttacaca ggatgaaatt atttctaaga caaagcaagt aattcagggg 360 ctggaagctt tgaagaatga gcacaattcc attttacaaa gtttgctgga gacactgaag 420 tgtttgaaga aagatgatga aagtaatttg gtggaggaga aatcaaacat gatccggaag 480 tcactggaga tgttggagct cggcctgagt gaggcacagg ttatgatggc tttgtcaaat 540 cacctgaatg ctgtggagtc cgagaagcag aaactgcgtg cgcaggttcg tcgtctgtgc 600 caggagaatc agtggctacg ggatgaactg gccaacacgc agcagaaact gcagaagagt 660 gagcagtctg tggctcaact ggaggaggag aagaagcatc tggagtttat gaatcagcta 720 aaaaaatatg atgacgacat ttccccatcc gaggacaaag acactgattc taccaaagag 780 cctctggatg accttttccc caatgatgaa gacgacccag ggcaaggaat ccagcagcag 840 cacagcagtg cagccgcggc tgcccagcag ggcggctacg agatccccgc gcggctgcgg 900 acgctccaca acctggtgat ccagtacgcc tcgcaggggc gctacgaggt agctgtgccc 960 ctctgcaagc aggccctgga ggacctggag aagacttcag gacacgacca cccggacgtg 1020 gccaccatgc tcaacatcct ggccttggtg tacagggatc agaataaata caaagatgca 1080 gctaacctac tgaatgatgc cttggctatt cgtgagaaaa ctttgggcaa agatcatcct 1140 gcggtggcgg cgactttgaa taaccttgca gtcctttatg gtaaaagagg gaagtacaaa 1200 gaagcagagc cgttgtgtaa aagagctctg gaaatccgag aaaaggtttt ggggaaggat 1260 caccccgatg ttgccaagca gttaaataac ttggccttac tgtgccagaa ccagggcaag 1320 tatgaagaag tagaatatta ttatcaaaga gccctcgaga tctaccagac aaaactggga 1380 cctgatgacc ccaacgtggc taagacgaaa aataacctgg catcctgcta tttgaaacaa 1440 ggaaagttca agcaagcaga aacactgtac aaagagattc tcactcgtgc acatgaaagg 1500 gagtttggtt ctgtagatga tgaaaataaa cccatctgga tgcatgctga agaaagagaa 1560 gaatgcaaag gaaagcaaaa ggatgggaca tcttttggag agtatggcgg ctggtacaaa 1620 gcctgcaaag ttgatagtcc aactgttaca accactctaa aaaaccttgg ggcactttac 1680 agacgtcaag gcaaatttga agctgcagaa acgttagaag aagctgctat gaggtctcgt 1740 aaacagggtc ttgacaatgt tcacaaacag agggtggcag aagtgctcaa tgaccctgag 1800 aacatggaga agcgcaggag ccgtgagagc ctcaacgtgg acgtggtcaa gtacgagagt 1860 ggccctgacg gaggggagga agtgagtatg agcgtagagt ggaacggggg cgtctctggc 1920 cgagcctctt tttgtggaaa acgacagcag cagcagtggc ctggaagacg ccaccgctaa 1980 ctgaccccga cctggccccg ctccaggatg ggactgccga gtgtggcccg gagctggccc 2040 gggacagcca gggcggcagg gaggcccctg gccgggagcg cagcgctcac tcatttctcc 2100 tgcgtctgtg tgcataggac atgatactaa taaccacacg gctggcgtga ccttggggct 2160 ggggctgggc ctaagctggt gccctggtgc ggcgtggtct ctcccaggag acctggggca 2220 tgagctgggc ccacggctcc cttcccatgt gtaacttcct cacgttgtgt gcgataacgt 2280 attttattgt acacccgaat tc 2302 292 10190 DNA Homo sapiens 292 gagaggtcgt tttcccgtcc ccgagagcaa gtttatttac aaatgttgga gtaataaaga 60 aggcagaaca aaatgagctg ggctttggaa gaatggaaag aagggctgcc tacaagagct 120 cttcagaaaa ttcaagagct tgaaggacag cttgacaaac tgaagaagga aaagcagcaa 180 aggcagtttc agcttgacag tctcgaggct gcgcctcaga agcaaacaca gaaggttgaa 240 aatgaaaaaa ccgagggtac aaacctgaaa agggagaatc aaagattgat ggaaatatgt 300 gaaagtctgg agaaaactaa gcagaagatt tctcatgaac ttcaagtcaa ggagtcacaa 360 gtgaatttcc aggaaggaca actgaattca ggcaaaaaac aaatagaaaa actggaacag 420 gaacttaaaa ggtgtaaatc tgagcttgaa agaagccaac aagctgcgca gtctgcagat 480 gtctctctga atccatgcaa tacaccacaa aaaattttta caactccact aacaccaagt 540 caatattata gtggttccaa gtatgaagat ctaaaagaaa aatataataa agaggttgaa 600 gaacgaaaaa gattagaggc agaggttaaa gccttgcagg ctaaaaaagc aagccagact 660 cttccacaag ccaccatgaa tcaccgcgac attgcccggc atcaggcttc atcatctgtg 720 ttctcatggc agcaagagaa gaccccaagt catctttcat ctaattctca aagaactcca 780 attaggagag atttctctgc atcttacttt tctggggaac aagaggtgac tccaagtcga 840 tcaactttgc aaatagggaa aagagatgct aatagcagtt tctttgacaa ttctagcagt 900 cctcatcttt tggatcaatt aaaagcgcag aatcaagagc taagaaacaa gattaatgag 960 ttggaactac gcctgcaagg acatgaaaaa gaaatgaaag gccaagtgaa taagtttcaa 1020 gaactccaac tccaactgga gaaagcaaaa gtggaattaa ttgaaaaaga gaaagttttg 1080 aacaaatgta gggatgaact agtgagaaca acagcacaat acgaccaggc gtcaaccaag 1140 tatactgcat tggaacaaaa actgaaaaaa ttgacggaag atttgagttg tcagcgacaa 1200 aatgcagaaa gtgccagatg ttctctggaa cagaaaatta aggaaaaaga aaaggagttt 1260 caagaggagc tctcccgtca acagcgttct ttccaaacac tggaccagga gtgcatccag 1320 atgaaggcca gactcaccca ggagttacag caagccaaga atatgcacaa cgtcctgcag 1380 gctgaactgg ataaactcac atcagtaaag caacagctag aaaacaattt ggaagagttt 1440 aagcaaaagt tgtgcagagc tgaacaggcg ttccaggcga gtcagatcaa ggagaatgag 1500 ctgaggagaa gcatggagga aatgaagaag gaaaacaacc tccttaagag tcactctgag 1560 caaaaggcca gagaagtctg ccacctggag gcagaactca agaacatcaa acagtgttta 1620 aatcagagcc agaattttgc agaagaaatg aaagcgaaga atacctctca ggaaaccatg 1680 ttaagagatc ttcaagaaaa aataaatcag caagaaaact ccttgacttt agaaaaactg 1740 aagcttgctg tggctgatct ggaaaagcag cgagattgtt ctcaagacct tttgaagaaa 1800 agagaacatc acattgaaca acttaatgat aagttaagca agacagagaa agagtccaaa 1860 gccttgctga gtgctttaga gttaaaaaag aaagaatatg aattgaaaga agagaaaact 1920 ctgttttctt gttggaaaag tgaaaacgaa aaacttttaa ctcagatgga atcagaaaag 1980 gaaaacttgc agagtaaaat taatcacttg gaaacttgtc tgaagacaca gcaaataaaa 2040 agtcatgaat acaacgagag agtaagaacg ctggagatgg acagagaaaa cctaagtgtc 2100 gagatcagaa accttcacaa cgtgttagac agtaagtcag tggaggtaga gacccagaaa 2160 ctagcttata tggagctaca gcagaaagct gagttctcag atcagaaaca tcagaaggaa 2220 atagaaaata tgtgtttgaa gacttctcag cttactgggc aagttgaaga tctagaacac 2280 aagcttcagt tactgtcaaa tgaaataatg gacaaagacc ggtgttacca agacttgcat 2340 gccgaatatg agagcctcag ggatctgcta aaatccaaag atgcttctct ggtgacaaat 2400 gaagatcatc agagaagtct tttggctttt gatcagcagc ctgccatgca tcattccttt 2460 gcaaatataa ttggagaaca aggaagcatg ccttcagaga ggagtgaatg tcgtttagaa 2520 gcagaccaaa gtccgaaaaa ttctgccatc ctacaaaata gagttgattc acttgaattt 2580 tcattagagt ctcaaaaaca gatgaactca gacctgcaaa agcagtgtga agagttggtg 2640 caaatcaaag gagaaataga agaaaatctc atgaaagcag aacagatgca tcaaagtttt 2700 gtggctgaaa caagtcagcg cattagtaag ttacaggaag acacttctgc tcaccagaat 2760 gttgttgctg aaaccttaag tgcccttgag aacaaggaaa aagagctgca acttttaaat 2820 gataaggtag aaactgagca ggcagagatt caagaattaa aaaagagcaa ccatctactt 2880 gaagactctc taaaggagct acaactttta tccgaaaccc taagcttgga gaagaaagaa 2940 atgagttcca tcatttctct aaataaaagg gaaattgaag agctgaccca agagaatggg 3000 actcttaagg aaattaatgc atccttaaat caagagaaga tgaacttaat ccagaaaagt 3060 gagagttttg caaactatat agatgaaagg gagaaaagca tttcagagtt atctgatcag 3120 tacaagcaag aaaaacttat tttactacaa agatgtgaag aaaccggaaa tgcatatgag 3180 gatcttagtc aaaaatacaa agcagcacag gaaaagaatt ctaaattaga atgcttgcta 3240 aatgaatgca ctagtctttg tgaaaatagg aaaaatgagt tggaacagct aaaggaagca 3300 tttgcaaagg aacaccaaga attcttaaca aaattagcat ttgctgaaga aagaaatcag 3360 aatctgatgc tagagttgga gacagtgcag caagctctga gatctgagat gacagataac 3420 caaaacaatt ctaagagcga ggctggtggt ttaaagcaag aaatcatgac tttaaaggaa 3480 gaacaaaaca aaatgcaaaa ggaagttaat gacttattac aagagaatga acagctgatg 3540 aaggtaatga agactaaaca tgaatgtcaa aatctagaat cagaaccaat taggaactct 3600 gtgaaagaaa gagagagtga gagaaatcaa tgtaatttta aacctcagat ggatcttgaa 3660 gttaaagaaa tttctctaga tagttataat gcgcagttgg tgcaattaga agctatgcta 3720 agaaataagg aattaaaact tcaggaaagt gagaaggaga aggagtgcct gcagcatgaa 3780 ttacagacaa ttagaggaga tcttgaaacc agcaatttgc aagacatgca gtcacaagaa 3840 attagtggcc ttaaagactg tgaaatagat gcggaagaaa agtatatttc agggcctcat 3900 gagttgtcaa caagtcaaaa cgacaatgca caccttcagt gctctctgca aacaacaatg 3960 aacaagctga atgagctaga gaaaatatgt gaaatactgc aggctgaaaa gtatgaactc 4020 gtaactgagc tgaatgattc aaggtcagaa tgtatcacag caactaggaa aatggcagaa 4080 gaggtaggga aactactaaa tgaagttaaa atattaaatg atgacagtgg tcttctccat 4140 ggtgagttag tggaagacat accaggaggt gaatttggtg aacaaccaaa tgaacagcac 4200 cctgtgtctt tggctccatt ggacgagagt aattcctacg agcacttgac attgtcagac 4260 aaagaagttc aaatgcactt tgccgaattg caagagaaat tcttatcttt acaaagtgaa 4320 cacaaaattt tacatgatca gcactgtcag atgagctcta aaatgtcaga gctgcagacc 4380 tatgttgact cattaaaggc cgaaaatttg gtcttgtcaa cgaatctgag aaactttcaa 4440 ggtgacttgg tgaaggagat gcagctgggc ttggaggagg ggctcgttcc atccctgtca 4500 tcctcttgtg tgcctgacag ctctagtctt agcagtttgg gagactcctc cttttacaga 4560 gctcttttag aacagacagg agatatgtct cttttgagta atttagaagg ggctgtttca 4620 gcaaaccagt gcagtgtaga tgaagtattt tgcagcagtc tgcaggagga gaatctgacc 4680 aggaaagaaa ccccttcggc cccagcgaag ggtgttgaag agcttgagtc cctctgtgag 4740 gtgtaccggc agtccctcga gaagctagaa gagaaaatgg aaagtcaagg gattatgaaa 4800 aataaggaaa ttcaagagct cgagcagtta ttaagttctg aaaggcaaga gcttgactgc 4860 cttaggaagc agtatttgtc agaaaatgaa cagtggcaac agaagctgac aagcgtgact 4920 ctggagatgg agtccaagtt ggcggcagaa aagaaacaga cggaacaact gtcacttgag 4980 ctggaagtag cacgactcca gctacaaggt ctggacttaa gttctcggtc tttgcttggc 5040 atcgacacag aagatgctat tcaaggccga aatgagagct gtgacatatc aaaagaacat 5100 acttcagaaa ctacagaaag aacaccaaag catgatgttc atcagatttg tgataaagat 5160 gctcagcagg acctcaatct agacattgag aaaataactg agactggtgc attgaaaccc 5220 acaggagagt gctctgggga acagtcccca gataccaatt atgagcctcc aggggaagat 5280 aaaacccagg gctcttcaga atgcatttct gaattgtcat tttctggtcc taatgctttg 5340 gtacctatgg atttcctggg gaatcaggaa gatatccata atcttcaact gcgggtaaaa 5400 gagacatcaa atgagaattt gagattactt catgtgatag aggaccgtga cagaaaagtt 5460 gaaagtttgc taaatgaaat gaaagaatta gactcaaaac tccatttaca ggaggtacaa 5520 ctaatgacca aaattgaagc atgcatagaa ttggaaaaaa tagttgggga acttaagaaa 5580 gaaaactcag atttaagtga aaaattggaa tatttttctt gtgatcacca ggagttactc 5640 cagagagtag aaacttctga aggcctcaat tctgatttag aaatgcatgc agataaatca 5700 tcacgtgaag atattggaga taatgtggcc aaggtgaatg acagctggaa ggagagattt 5760 cttgatgtgg aaaatgagct gagtaggatc agatcggaga aagctagcat tgagcatgaa 5820 gccctctacc tggaggctga cttagaggta gttcaaacag agaagctatg tttagaaaaa 5880 gacaatgaaa ataagcagaa ggttattgtc tgccttgaag aagaactctc agtggtcaca 5940 agtgagagaa accagcttcg tggagaatta gatactatgt caaaaaaaac cacggcactg 6000 gatcagttgt ctgaaaaaat gaaggagaaa acacaagagc ttgagtctca tcaaagtgag 6060 tgtctccatt gcattcaggt ggcagaggca gaggtgaagg aaaagacgga actccttcag 6120 actttgtcct ctgatgtgag tgagctgtta aaagacaaaa ctcatctcca ggaaaagctg 6180 cagagtttgg aaaaggactc acaggcactg tctttgacaa aatgtgagct ggaaaaccaa 6240 attgcacaac tgaataaaga gaaagaattg cttgtcaagg aatctgaaag cctgcaggcc 6300 agactgagtg aatcagatta tgaaaagctg aatgtctcca aggccttgga ggccgcactg 6360 gtggagaaag gtgagttcgc attgaggctg agctcaacac aggaggaagt gcatcagctg 6420 agaagaggca tcgagaaact gagagttcgc attgaggccg atgaaaagaa gcagctgcac 6480 atcgcagaga aactgaaaga acgcgagcgg gagaatgatt cacttaagga taaagttgag 6540 aaccttgaaa gggaattgca gatgtcagaa gaaaaccagg agctagtgat tcttgatgcc 6600 gagaattcca aagcagaagt agagactcta aaaacacaaa tagaagagat ggccagaagc 6660 ctgaaagttt ttgaattaga ccttgtcacg ttaaggtctg aaaaagaaaa tctgacaaaa 6720 caaatacaag aaaaacaagg tcagttgtca gaactagaca agttactctc ttcatttaaa 6780 agtctgttag aagaaaagga gcaagcagag atacagatca aagaagaatc taaaactgca 6840 gtggagatgc ttcagaatca gttaaaggag ctaaatgagg cagtagcagc cttgtgtggt 6900 gaccaagaaa ttatgaaggc cacagaacag agtctagacc caccaataga ggaagagcat 6960 cagctgagaa atagcattga aaagctgaga gcccgcctag aagctgatga aaagaagcag 7020 ctctgtgtct tacaacaact gaaggaaagt gagcatcatg cagatttact taagggtaga 7080 gtggagaacc ttgaaagaga gctagagata gccaggacaa accaagagca tgcagctctt 7140 gaggcagaga attccaaagg agaggtagag accctaaaag caaaaataga agggatgacc 7200 caaagtctga gaggtctgga attagatgtt gttactataa ggtcagaaaa agaaaatctg 7260 acaaatgaat tacaaaaaga gcaagagcga atatctgaat tagaaataat aaattcatca 7320 tttgaaaata ttttgcaaga aaaagagcaa gagaaagtac agatgaaaga aaaatcaagc 7380 actgccatgg agatgcttca aacacaatta aaagagctca atgagagagt ggcagccctg 7440 cataatgacc aagaagcctg taaggccaaa gagcagaatc ttagtagtca agtagagtgt 7500 cttgaacttg agaaggctca gttgctacaa ggccttgatg aggccaaaaa taattatatt 7560 gttttgcaat cttcagtgaa tggcctcatt caagaagtag aagatggcaa gcagaaactg 7620 gagaagaagg atgaagaaat cagtagactg aaaaatcaaa ttcaagacca agagcagctt 7680 gtctctaaac tgtcccaggt ggaaggagag caccaacttt ggaaggagca aaacttagaa 7740 ctgagaaatc tgacagtgga attggagcag aagatccaag tgctacaatc caaaaatgcc 7800 tctttgcagg acacattaga agtgctgcag agttcttaca agaatctaga gaatgagctt 7860 gaattgacaa aaatggacaa aatgtccttt gttgaaaaag taaacaaaat gactgcaaag 7920 gaaactgagc tgcagaggga aatgcatgag atggcacaga aaacagcaga gctgcaagaa 7980 gaactcagtg gagagaaaaa taggctagct ggagagttgc agttactgtt ggaagaaata 8040 aagagcagca aagatcaatt gaaggagctc acactagaaa atagtgaatt gaagaagagc 8100 ctagattgca tgcacaaaga ccaggtggaa aaggaaggga aagtgagaga ggaaatagct 8160 gaatatcagc tacggcttca tgaagctgaa aagaaacacc aggctttgct tttggacaca 8220 aacaaacagt atgaagtaga aatccagaca taccgagaga aattgacttc taaagaagaa 8280 tgtctcagtt cacagaagct ggagatagac cttttaaagt ctagtaaaga agagctcaat 8340 aattcattga aagctactac tcagattttg gaagaattga agaaaaccaa gatggacaat 8400 ctaaaatatg taaatcagtt gaagaaggaa aatgaacgtg cccaggggaa aatgaagttg 8460 ttgatcaaat cctgtaaaca gctggaagag gaaaaggaga tactgcagaa agaactctct 8520 caacttcaag ctgcacagga gaagcagaaa acaggtactg ttatggatac caaggtcgat 8580 gaattaacaa ctgagatcaa agaactgaaa gaaactcttg aagaaaaaac caaggaggca 8640 gatgaatact tggataagta ctgttccttg cttataagcc atgaaaagtt agagaaagct 8700 aaagagatgt tagagacaca agtggcccat ctgtgttcac agcaatctaa acaagattcc 8760 cgagggtctc ctttgctagg tccagttgtt ccaggaccat ctccaatccc ttctgttact 8820 gaaaagaggt tatcatctgg ccaaaataaa gcttcaggca agaggcaaag atccagtgga 8880 atatgggaga atggtggagg accaacacct gctaccccag agagcttttc taaaaaaagc 8940 aagaaagcag tcatgagtgg tattcaccct gcagaagaca cggaaggtac tgagtttgag 9000 ccagagggac ttccagaagt tgtaaagaaa gggtttgctg acatcccgac aggaaagact 9060 agcccatata tcctgcgaag aacaaccatg gcaactcgga ccagcccccg cctggctgca 9120 cagaagttag cgctatcccc actgagtctc ggcaaagaaa atcttgcaga gtcctccaaa 9180 ccaacagctg gtggcagcag atcacaaaag gtcaaagttg ctcagcggag cccagtagat 9240 tcaggcacca tcctccgaga acccaccacg aaatccgtcc cagtcaataa tcttcctgag 9300 agaagtccga ctgacagccc cagagagggc ctgagggtca agcgaggccg acttgtcccc 9360 agccccaaag ctggactgga gtccaagggc agtgagaact gtaaggtcca gtgaaggcac 9420 tttgtgtgtc agtacccctg ggaggtgcca gtcattgaat agataaggct gtgcctacag 9480 gacttctctt tagtcagggc atgctttatt agtgaggaga aaacaattcc ttagaagtct 9540 taaatatatt gtactcttta gatctcccat gtgtaggtat tgaaaaagtt tggaagcact 9600 gatcacctgt tagcattgcc attcctctac tgcaatgtaa atagtataaa gctatgtata 9660 taaagctttt tggtaatatg ttacaattaa aatgacaagc actatatcac aatctctgtt 9720 tgtatgtggg ttttacacta aaaaaatgca aaacacattt tattcttcta attaacagct 9780 cctaggaaaa tgtagacttt tgctttatga tattctatct gtagtatgag gcatggaata 9840 gttttgtatc gggaatttct cagagctgag taaaatgaag gaaaagcatg ttatgtgttt 9900 ttaaggaaaa tgtgcacaca tatacatgta ggagtgttta tctttctctt acaatctgtt 9960 ttagacatct ttgcttatga aacctgtaca tatgtgtgtg tgggtatgtg tttatttcca 10020 gtgagggctg caggcttcct agaggtgtgc tataccatgc gtctgtcgtt gtgctttttt 10080 ctgtttttag accaattttt tacagttctt tggtaagcat tgtcgtatct ggtgatggat 10140 taacatatag cctttgtttt ctaataaaat agtcgccttc gttttctgta 10190 293 6409 DNA Homo sapiens 293 caaggcggcc ggcggcgacc atggcagcgg gccggcggcg gccgtagtgg cccaggcctg 60 ggcttcagcc tcccggggcc ccagagggcg gggcggtccg ggccgcggcg gtggcggcgc 120 cacttccctg ctcccgcccg aggactcctg cgggcactcg ctgaggacca gcggaccggc 180 ggcgcgaatc tgactgaggg gcggggacgc cgtctgttcc ccgccgctcc cggcagggcc 240 gggccgggcc gggctgggcc gggctgggcc gggcgggccc ctgggagcag cccccaggcg 300 ggggaccgcc ttggagaccc gaagccggag ctagaggcag gcggtgggcc cgggtggagt 360 cccggccgga gctggtggtt cgggggcggt gctaggcccc gaggctgcgg gacctgagcg 420 cgaggagcct gagtgcgggt ccagcggtgg cggcatgagc cggcccccgc cgacggggaa 480 aatgcccggc gcccccgaga ccgcgccggg ggacggggca ggcgcgagcc gccagaggaa 540 gctggaggcg ctgatccgag accctcgctc ccccatcaac gtggagagct tgctggatgg 600 cttaaattcc ttggtccttg atttagattt tcctgctttg aggaaaaaca agaacataga 660 taatttctta aatagatatg agaaaattgt gaaaaaaatc aaaggtctac agatgaaggc 720 agaagactat gatgttgtaa aagttattgg aagaggtgct tttggtgaag tgcagttggt 780 tcgtcacaag gcatcgcaga aggtttatgc tatgaagctt cttagtaagt ttgaaatgat 840 aaaaagatca gattctgcct ttttttggga agaaagagat attatggcct ttgccaatag 900 cccctgggtg gttcagcttt tttatgcctt tcaagatgat aggtatctgt acatggtaat 960 ggagtacatg cctggtggag accttgtaaa ccttatgagt aattatgatg tgcctgaaaa 1020 atgggccaaa ttttacactg ctgaagttgt tcttgctctg gatgcaatac actccatggg 1080 tttaatacac agagatgtga agcctgacaa catgctcttg gataaacatg gacatctaaa 1140 attagcagat tttggcacgt gtatgaagat ggatgaaaca ggcatggtac attgtgatac 1200 agcagttgga acaccggatt atatatcacc tgaggttctg aaatcacaag ggggtgatgg 1260 tttctatggg cgagaatgtg attggtggtc tgtaggtgtt ttcctttatg agatgctagt 1320 gggggatact ccattttatg cggattcact tgtaggaaca tatagcaaaa ttatggatca 1380 taagaattca ctgtgtttcc ctgaagatgc agaaatttcc aaacatgcaa agaatctcat 1440 ctgtgctttc ttaacagata gggaggtacg acttgggaga aatggggtgg aagaaatcag 1500 acagcatcct ttctttaaga atgatcagtg gcattgggat aacataagag aaacggcagc 1560 tcctgtagta cctgaactca gcagtgacat agacagcagc aatttcgatg acattgaaga 1620 tgacaaagga gatgtagaaa ccttcccaat tcctaaagct tttgttggaa atcagctgcc 1680 tttcatcgga tttacctact atagagaaaa tttattatta agtgactctc catcttgtag 1740 agaaaatgat tccatacaat caaggaaaaa tgaagaaagt caagagattc agaaaaaact 1800 gtatacatta gaagaacatc ttagcaatga gatgcaagcc aaagaggaac tggaacagaa 1860 gtgcaaatct gttaatactc gcctagaaaa aacagcaaag gagctagaag aggagattac 1920 cttacggaaa agtgtggaat cagcattaag acagttagaa agagaaaagg cgcttcttca 1980 gcacaaaaat gcagaatatc agaggaaagc tgatcatgaa gcagacaaaa aacgaaattt 2040 ggaaaatgat gttaacagct taaaagatca acttgaagat ttgaaaaaaa gaaatcaaaa 2100 ctctcaaata tccactgaga aagtgaatca actccagaga caactggatg aaaccaatgc 2160 tttactgcga acagagtctg atactgcagc ccggttaagg aaaacccagg cagaaagttc 2220 aaaacagatt cagcagctgg aatctaacaa tagagatcta caagataaaa actgcctgct 2280 ggagactgcc aagttaaaac ttgaaaagga atttatcaat cttcagtcag ctctagaatc 2340 tgaaaggagg gatcgaaccc atggatcaga gataattaat gatttacaag gtagaatatg 2400 tggcctagaa gaagatttaa agaacggcaa aatcttacta gcgaaagtag aactggagaa 2460 gagacaactt caggagagat ttactgattt ggaaaaggaa aaaagcaaca tggaaataga 2520 tatgacatac caactaaaag ttatacagca gagcctagaa caagaagaag ctgaacataa 2580 ggccacaaag gcacgactag cagataaaaa taagatctat gagtccatcg aagaagccaa 2640 atcagaagcc atgaaagaaa tggagaagaa gctcttggag gaaagaactt taaaacagaa 2700 agtggagaac ctattgctag aagctgagaa aagatgttct ctattagact gtgacctcaa 2760 acagtcacag cagaaaataa atgagctcct taaacagaaa gatgtgctaa atgaggatgt 2820 tagaaacctg acattaaaaa tagagcaaga aactcagaag cgctgcctta cacaaaatga 2880 cctgaagatg caaacacaac aggttaacac actaaaaatg tcagaaaagc agttaaagca 2940 agaaaataac catctcatgg aaatgaaaat gaacttggaa aaacaaaatg ctgaacttcg 3000 aaaagaacgt caggatgcag atgggcaaat gaaagagctc caggatcagc tcgaagcaga 3060 acagtatttc tcaacccttt ataaaacaca agttagggag cttaaagaag aatgtgaaga 3120 aaagaccaaa cttggtaaag aattgcagca gaagaaacag gaattacagg atgaacggga 3180 ctctttggct gcccaactgg agatcacctt gaccaaagca gattctgagc aactggctcg 3240 ttcaattgct gaagaacaat attctgattt ggaaaaagag aagatcatga aagagctgga 3300 gatcaaagag atgatggcta gacacaaaca ggaacttacg gaaaaagatg ctacaattgc 3360 ttctcttgag gaaactaata ggacactaac tagtgatgtt gccaatcttg caaatgagaa 3420 agaagaatta aataacaaat tgaaagatgt tcaagagcaa ctgtcaagat tgaaagatga 3480 agaaataagc gcagcagcta ttaaagcaca gtttgagaag cagctattaa cagaaagaac 3540 actcaaaact caagctgtga ataagttggc tgagatcatg aatcgaaaag aacctgtcaa 3600 gcgtggtaat gacacagatg tgcggagaaa agagaaggag aatagaaagc tacatatgga 3660 gcttaaatct gaacgtgaga aattgaccca gcagatgatc aagtatcaga aagaactgaa 3720 tgaaatgcag gcacaaatag ctgaagagag ccagattcga attgaactgc agatgacatt 3780 ggacagtaaa gacagtgaca ttgagcagct gcggtcacaa ctccaagcct tgcatattgg 3840 tctggatagt tccagtatag gcagtggacc aggggatgct gaggcagatg atgggtttcc 3900 agaatcaaga ttagaaggat ggctttcatt gcctgtacga aacaacacta agaaatttgg 3960 atgggttaaa aagtatgtga ttgtaagcag taagaagatt cttttctatg acagtgaaca 4020 agataaagaa caatccaatc cttacatggt tttagatata gacaagttat ttcatgtccg 4080 accagttaca cagacagatg tgtatagagc agatgctaaa gaaattccaa ggatattcca 4140 gattctgtat gccaatgaag gagaaagtaa gaaggaacaa gaatttccag tggagccagt 4200 tggagaaaaa tctaattata tttgccacaa gggacatgag tttattccta ctctttatca 4260 tttcccaacc aactgtgagg cttgtatgaa gcccctgtgg cacatgttta agcctcctcc 4320 tgctttggag tgccgccgtt gccatattaa gtgtcataaa gatcatatgg acaaaaagga 4380 ggagattata gcaccttgca aagtatatta tgatatttca acggcaaaga atctgttatt 4440 actagcaaat tctacagaag agcagcagaa gtgggttagt cggttggtga aaaagatacc 4500 taaaaagccc ccagctccag acccttttgc ccgatcatct cctagaactt caatgaagat 4560 acagcaaaac cagtctatta gacggccaag tcgacagctt gccccaaaca aacctagcta 4620 actgccttct atgaaagcag tcattattca aggtgatcgt attcttccag tgaaaacaag 4680 actgaaatat gatgaccaaa atttattaaa aagctatatt ttcctgagag actgatacat 4740 acactcatac atatatgtgt tccccttttc cctgtaatat aaattacaaa tctgggctcc 4800 tttgaagcaa caggttgaac caacaatgat tggttgatag actaaggata tatgcaactc 4860 ttccagactt ttccataaag ctctctcggc agtcgctcac actacagtgc acacaaggat 4920 tgagaagagt taaaggctaa agaaaacatc ttttctagct tcaacagaga ggtttcacca 4980 gcacatttac cagaagaatc tgggaatgga ttccactaca gtgatattga ctgcatcttt 5040 aagaagtgac cattatactg tgtatatata tataaacaca cacacatata tatatatata 5100 tagtactcta atactgcaag aaggtttttt aaacttccca ctttattttt tatacacatt 5160 aatcagatat cattacttgc tgcagttgca actatgcact tgtataaagc cataatgttg 5220 gagtttatat cactcattcc tgtgtacctg atggaagttg catgttcatg tttaagcagt 5280 tactgtaaca agaagtttaa agttaattat atcagtttcc taatgcttca tgataggcaa 5340 ctttacccat tttgaatgcc ttaatttaat ttttttcaaa gtctcagccc tgtctgtatt 5400 aaaaaacaaa aaaagcgttt accagctctt aggatgtaaa ctagctttgt ggaagataaa 5460 tcgtgcacta tttttacaca taaatagtta tatcaatgtc agcctatttt gattaacaaa 5520 tgtttttaaa gtattattgg ttatagaaac aataatggat ggtgttggaa ctaatatatc 5580 cttgatgtct gtctattatt cattcaactc tttttacaga cctcagtatt agtctgtgac 5640 tacaaaatat tttatttgct ttaaatttgc tggctaccct agatgtgttt ttattcctgg 5700 taaagacatt tgtgattaca ttttcacact taagattcaa aatttttccc aaatataaag 5760 aaaactaaga cagactgtag atgcatttta aatatttaaa tatgatcctc agacatgcag 5820 ctgtgtgtgg cagtatttta gtaccgggtt aagaaaactg gcaactggga agaagtggcc 5880 tcaaaggcac ttaatttgat ttttattttt taaatgctgt caaagttaca gtttacgcag 5940 gacattcttg ccgtattctc atgatcccag ataagtgtgt gttttatact gcaacaatat 6000 gcagcaatgg taagcgtaaa gttttttttt gtttttgttt ttgttttttt ttatattatg 6060 aagtctttta acagtctctc tttatataaa tacacagagt ttggtatgat atttaaatac 6120 atcatctggc caggcatggt ggcttacgcc tgtaatccta gcactttggg aggccaagac 6180 gggcggatca cctgaggtga ggagttcaag accagcctgc ccaacatagt gaaactccgt 6240 ctctaccaat atacaaaaat tagccgggca tgatggtggt ggcctgtaat cccagctact 6300 tgggaggctg agacaggaga atcgcttgaa cccaggagac ggtggttgca gtgagcgaag 6360 atcgagccac tgcactccag cctgggcagc tgaacaagac tccgtctcc 6409 294 1383 DNA Homo sapiens 294 ctagtttcta aggatcatgt ctgcgagcca ggattcccga tccagagaca atggccccga 60 tgggatggag cccgaaggcg tcatcgagag taactggaat gagattgttg acagctttga 120 tgacatgaac ctctcggagt cccttctccg tggcatctac gcctatggtt ttgagaagcc 180 ctctgccatc cagcagcgag ccattctacc ttgtatcaag ggttatgatg tgattgctca 240 agcccaatct gggactggga aaacggccac atttgccata tcgattctgc agcagattga 300 attagatcta aaagccaccc aggccttggt cctagcaccc actcgagaat tggctcagca 360 gatacagaag gtggtcatgg cactaggaga ctacatgggc gcctcctgtc acgcctgtat 420 cgggggcacc aacgtgcgtg ctgaggtgca gaaactgcag atggaagctc cccacatcat 480 cgtgggtacc cctggccgtg tgtttgatat gcttaaccgg agatacctgt cccccaaata 540 catcaagatg tttgtactgg atgaagctga cgaaatgtta agccgtggat tcaaggacca 600 gatctatgac atattccaaa agctcaacag caacacccag gtagttttgc tgtcagccac 660 aatgccttct gatgtgcttg aggtgaccaa gaagttcatg agggacccca ttcggattct 720 tgtcaagaag gaagagttga ccctggaggg tatccgccag ttctacatca acgtggaacg 780 agaggagtgg aagctggaca cactatgtga cttgtatgaa accctgacca tcacccaggc 840 agtcatcttc atcaacaccc ggaggaaggt ggactggctc accgagaaga tgcatgctcg 900 agatttcact gtatccgcca tgcatggaga tatggaccaa aaggaacgag acgtgattat 960 gagggagttt cgttctggct ctagcagagt tttgattacc actgacctgc tggccagagg 1020 cattgatgtg cagcaggttt ctttagtcat caactatgac cttcccacca acagggaaaa 1080 ctatatccac agaatcggtc gaggtggacg gtttggccgt aaaggtgtgg ctattaacat 1140 ggtgacagaa gaagacaaga ggactcttcg agacattgag accttctaca acacctccat 1200 tgaggaaatg cccctcaatg ttgctgacct catctgaggg gctgtcctgc cacccagccc 1260 cagccagggc tcaatctctg ggggctgagg agcagcagga ggggggaggg aagggagcca 1320 agggatggac atcttgtcat tttttttctt tgaataaatg tcactttttg aggcaaaaga 1380 agg 1383 295 188 DNA Homo sapiens misc_feature (1)...(188) n = A,T,C or G 295 cgcggscggg cggtgcgcac tgcgggcnnt ccctgccccg gcgccgtccg tgcccgcggg 60 acctgacagc cgggtcagag ggcgangtgn tgctcaggcc cgggctggac gcagagccag 120 agctgtcccc agaggagcag agggtcctgg aaaggaagct gaaaaaggaa cggaagaaag 180 aggagagg 188 296 3161 DNA Homo sapiens 296 gccccctctt ctcccctccc ccaagccagc acctggtcgc ccggcgggtc gtgcggcgcg 60 gcgctccgcg agagctggcg gtcaccatgg caatgcggga gctggtggag gccgaatgcg 120 ggggtgccaa cccgctcatg aagctcgccg ggcacttcac ccaggacaag gcccttcggc 180 aggagggatt gaggcctggc ccctggcccc ccggagcccc ggcctctgag gcagcctcca 240 agcctttggg agtagcttct gaagatgagt tggtggctga attcctgcag gaccagaatg 300 caccccttgt gtcccgtgcc cctcagacct tcaagatgga tgacctcctg gctgagatgc 360 agcagattga gcagtcaaac ttccgccagg ctccccagag agcccctggt gtggcagact 420 tggccttgtc tgagaactgg gcccaggagt ttcttgcagc tggagatgct gtggatgtaa 480 ctcaggatta taatgagact gactggtccc aagaattcat ctctgaagtt acagacccct 540 tgtctgtgtc ccctgcccgc tgggctgagg aatatttgga gcaatcagag gagaagctgt 600 ggctgggaga acctgaggga acagccaccg atcgctggta tgatgaatat catcctgagg 660 aggatctgca gcacacggcc agtgactttg tggccaaagt ggatgacccc aaattggcta 720 attctgaggg tacatcagat gcctgggttg accagttcac aagaccagta aacacatctg 780 cccttgatat ggagtttgaa cgagccaagt cagctataga gtctgatgtc gatttctggg 840 acaagttgca ggcagagttg gaggagatgg caaaacggga tgctgaggcc cacccctggc 900 tttctgacta tgatgacctt acgtcagcta cctatgataa ggggtaccag tttgaggagg 960 agaacccctt gcgtgatcac cctcagcctt ttgaagaagg gctgcggcgc cttcaggagg 1020 gggacctgcc aaatgctgtg ctgctttttg aggcagctgt gcagcaggat cctaagcaca 1080 tggaagcttg gcagtatctg ggtaccaccc aggcagagaa tgaacaagaa ctattagcca 1140 tcagtgcatt gcggaggtgt ctggagctaa agccagataa ccagacagca ctgatggcgc 1200 tggctgtgag cttcaccaac gagtccctgc agcgacaggc ctgtgaaacc ctacgagact 1260 ggctgcggta cacaccagcc tatgcccatc tggtgacacc tgctgaagaa ggggctggtg 1320 gggcaggact gggccccagc aagcgtatcc tgggatctct cttgtctgac tccctgtttc 1380 ttgaagtgaa agagctcttc ctggcagctg tgcggctgga ccctacctcc attgaccctg 1440 atgtgcagtg tggcttggga gtccttttca acctgagtgg ggagtatgac aaggccgtgg 1500 actgcttcac agctgccctc agcgttcgtc ccaatgacta tttgctgtgg aataagctag 1560 gcgccaccct ggccaatgga aaccagagtg aagaagcagt agctgcgtac cgccgggccc 1620 tcgagctcca gcctggctat atccggtccc gctataacct gggcatcagc tgcatcaacc 1680 tcggggctca ccgggaggct gtggagcact ttctggaggc cctgaacatg cagaggaaaa 1740 gccggggccc ccggggtgaa ggaggtgcca tgtcggagaa catctggagc accctgcgtt 1800 tggcattgtc tatgttaggc cagagcgatg cctatggggc agccgacgcg cgggatctgt 1860 ccaccctcct aactatgttt ggcctgcccc agtgacagtg ggacgggctg ccctgtgagt 1920 gtccacctgg agggatcccc gctttggatg tgattccctc tccccaaatg ggcctaccaa 1980 gggggcgggc tgatgaccat aagcggtacg gcctttcagg agctgcctca acgtaggggt 2040 gggtagtctg tgttctagtt cctacataat tgtaggaaaa tgagctgtgt catctctgag 2100 tcccttggta attcaagggc tgtacatcca gctacagatc tctctgctca tcatgccctt 2160 tcttggtgct gctttttggg taggacccca cgatttaggg taactgttat catcagctgc 2220 catttctgat agggtctacc acatctgtaa tgtctgtcct ttcccccact tttactggga 2280 attgatagtc cagcttcctt gggcagtgta agtaggaggt tcatctgctg tgcgcctcta 2340 atgtctgtct ggatgggatg tgttaggagt tggcctgttg ggttgaattg ttgatttggc 2400 tgagcagagc tgagttttgg taggagtgct catggttctg tcattcttgg acctctcctg 2460 gctgagctct gattccctgt gagcacgatg ctgatgcaat agtcctgtgt catcactgca 2520 gcggtcctca ggagctgcca gggccaattg ctacagagtg tctgggtgtg tggcatagga 2580 ggaaggtttg cttgtgaaat gaggctgggt gggagcgggg agggactaga tcagaagaga 2640 tcaagggctg tattcaggaa cgttggtggg aggacagagc aagtgggaag ggggtatggt 2700 gagtgcggca atccctcatc ctcttagaag cacctgtgaa tgggaattga gccaactgtt 2760 atagaaaatt ggttcagaaa gtgcaatctt gccagatttc tagcaaatag gttcagtgtt 2820 accataagcc tttgctgtac ttcttgaaat gtttctaggg gagagcattg gaaaatcccc 2880 ttcccccatc tagatcgaag gaagatgagg gagcagcttg gattcttctc agttgtcccc 2940 tgcatgggga gatacactaa cccccagaaa tgactgctaa gcctcttgcc ttgtctttag 3000 tagctaatga tcagagagat tttttttttt aaactaccat ggtcccagga ttccatcctg 3060 aaatttattt ttctttgtat gaatatgtgt aaatgattta aaaataaaac tgtaaaatat 3120 ttgtacgaag aataaatgga actgatgtgg gaaaaaaaaa a 3161 297 7923 DNA Homo sapiens 297 gcgcactcgg gcacgcgctc ggaagtcggg ggtcggcgcg gagtgcaggc tgctcccggg 60 gtaggtgagg gaagcgcgga ggcggggcgc gggggcagtg gtcggcgagc agcgcggtcc 120 tcgctagggg cgcccacccg tcagtctctc cggcgcgagc cgccgccacc gcccgcgccg 180 gagtcaggcc cctgggcccc caggctcaag cagcgaagcg gcctccgggg gacgccgcta 240 ggcgagagga acgcgccggt gcccttgcct tcgccgtgac ccagcgtgcg ggcggcggga 300 tgagagggag ccatcgggcc gcgccggccc tgcggccccg ggggcggctc tggcccgtgc 360 tggccgtgct ggcggcggcc gccgcggcgg gctgtgccca ggcagccatg gacgagtgca 420 cggacgaggg cgggcggccg cagcgctgca tgcccgagtt cgtcaacgcc gctttcaacg 480 tgactgtggt ggccaccaac acgtgtggga ctccgcccga ggaatactgt gtgcagaccg 540 gggtgaccgg ggtcaccaag tcctgtcacc tgtgcgacgc cgggcagccc cacctgcagc 600 acggggcagc cttcctgacc gactacaaca accaggccga caccacctgg tggcaaagcc 660 agaccatgct ggccggggtg cagtacccca gctccatcaa cctcacgctg cacctgggaa 720 aagcttttga catcacctat gtgcgtctca agttccacac cagccgcccg gagagctttg 780 ccatttacaa gcgcacacgg gaagacgggc cctggattcc ttaccagtac tacagtggtt 840 cctgcgagaa cacctactcc aaggcaaacc gcggcttcat caggacagga ggggacgagc 900 agcaggcctt gtgtactgat gaattcagtg acatttctcc cctcactggg ggcaacgtgg 960 ccttttctac cctggaagga aggcccagcg cctataactt tgacaatagc cctgtgctgc 1020 aggaatgggt aactgccact gacatcagag taactcttaa tcgcctgaac acttttggag 1080 atgaagtgtt taacgatccc aaagttctca agtcctatta ttatgccatc tctgattttg 1140 ctgtaggtgg cagatgtaaa tgtaatggac acgcaagcga gtgtatgaag aacgaatttg 1200 ataagctggt gtgtaattgc aaacataaca catatggagt agactgtgaa aagtgtcttc 1260 ctttcttcaa tgaccggccg tggaggaggg caactgcgga aagtgccagt gaatgcctgc 1320 cctgtgattg caatggtcga tcccaggaat gctacttcga ccctgaactc tatcgttcca 1380 ctggccatgg gggccactgt accaactgcc aggataacac agatggcgcc cactgtgaga 1440 ggtgccgaga gaacttcttc cgccttggca acaatgaagc ctgctcttca tgccactgta 1500 gtcctgtggg ctctctaagc acacagtgtg atagttacgg cagatgcagc tgtaagccag 1560 gagtgatggg ggacaaatgt gaccgttgcc agcctggatt ccattctctc actgaagcag 1620 gatgcaggcc atgctcttgt gatccctctg gcagcataga tgaatgtaat gttgaaacag 1680 gaagatgtgt ttgcaaagac aatgtcgaag gcttcaattg tgaaagatgc aaacctggat 1740 tttttaatct ggaatcatct aatcctcggg gttgcacacc ctgcttctgc tttgggcatt 1800 cttctgtctg tacaaacgct gttggctaca gtgtttattc tatctcctct acctttcaga 1860 ttgatgagga tgggtggcgt gcggaacaga gagatggctc tgaagcatct ctcgagtggt 1920 cctctgagag gcaagatatc gccgtgatct cagacagcta ctttcctcgg tacttcattg 1980 ctcctgcaaa gttcttgggc aagcaggtgt tgagttatgg tcagaacctc tccttctcct 2040 ttcgagtgga caggcgagat actcgcctct ctgccgaaga ccttgtgctt gagggagctg 2100 gcttaagagt atctgtaccc ttgatcgctc agggcaattc ctatccaagt gagaccactg 2160 tgaagtatgt cttcaggctc catgaagcaa cagattaccc ttggaggcct gctcttaccc 2220 cttttgaatt tcagaagctc ctaaacaact tgacctctat caagatacgt gggacataca 2280 gtgagagaag tgctggatat ttggatgatg tcaccctggc aagtgctcgt cctgggcctg 2340 gagtccctgc aacttgggtg gagtcctgca cctgtcctgt gggatatgga gggcagtttt 2400 gtgagatgtg cctctcaggt tacagaagag aaactcctaa tcttggacca tacagtccat 2460 gtgtgctttg cgcctgcaat ggacacagcg agacctgtga tcctgagaca ggtgtttgta 2520 actgcagaga caatacggct ggcccgcact gtgagaagtg cagtgatggg tactatggag 2580 attcaactgc aggcacctcc tccgattgcc aaccctgtcc gtgtcctgga ggttcaagtt 2640 gtgctgttgt tcccaagaca aaggaggtgg tgtgcaccaa ctgtcctact ggcaccactg 2700 gtaagagatg tgagctctgt gatgatggct actttggaga ccccctgggt agaaacggcc 2760 ctgtgagact ttgccgcctg tgccagtgca gtgacaacat cgatcccaac gcagttggaa 2820 attgcaatcg cttgacggga gaatgcctga agtgcatcta taacactgct ggcttctatt 2880 gtgaccggtg caaagacgga ttttttggaa atcccctggc tcccaatcca gcagacaaat 2940 gcaaagcctg caattgcaat ccgtatggga ccatgaagca gcagagcagc tgtaaccccg 3000 tgacggggca gtgtgaatgt ttgcctcacg tgactggcca ggactgtggt gcttgtgacc 3060 ctggattcta caatctgcag agtgggcaag gctgtgagag gtgtgactgc catgccttgg 3120 gctccaccaa tgggcagtgt gacatccgca ccggccagtg tgagtgccag cccggcatca 3180 ctggtcagca ctgtgagcgc tgtgaggtca accactttgg gtttggacct gaaggctgca 3240 aaccctgtga ctgtcatcct gagggatctc tttcacttca gtgcaaagat gatggtcgct 3300 gtgaatgcag agaaggcttt gtgggaaatc gctgtgacca gtgtgaagaa aactatttct 3360 acaatcggtc ttggcctggc tgccaggaat gtccagcttg ttaccggctg gtaaaggata 3420 aggttgctga tcatagagtg aagctccagg aattagagag tctcatagca aaccttggaa 3480 ctggggatga gatggtgaca gatcaagcct tcgaggatag actaaaggaa gcagagaggg 3540 aagttatgga cctccttcgt gaggcccagg atgtcaaaga tgttgaccag aatttgatgg 3600 atcgcctaca gagagtgaat aacactctgt ccagccaaat tagccgttta cagaatatcc 3660 ggaataccat tgaagagact ggaaacttgg ctgaacaagc gcgtgcccat gtagagaaca 3720 cagagcggtt gattgaaatc gcatccagag aacttgagaa agcaaaagtc gctgctgcca 3780 atgtgtcagt cactcagcca gaatctacag gggacccaaa caacatgact cttttggcag 3840 aagaggctcg aaagcttgct gaacgtcata aacaggaagc tgatgacatt gttcgagtgg 3900 caaagacagc caatgatacg tcaactgagg catacaacct gcttctgagg acactggcag 3960 gagaaaatca aacagcattt gagattgaag agcttaatag gaagtatgaa caagcgaaga 4020 acatctcaca ggatctggaa aaacaagctg cccgagtaca tgaggaggcc aaaagggccg 4080 gtgacaaagc tgtggagatc tatgccagcg tggctcagct gagccctttg gactctgaga 4140 cactggagaa tgaagcaaat aacataaaga tggaagctga gaatctggaa caactgattg 4200 accagaaatt aaaagattat gaggacctca gagaagatat gagagggaag gaacttgaag 4260 tcaagaacct tctggagaaa ggcaagactg aacagcagac cgcagaccaa ctcctagccc 4320 gagctgatgc tgccaaggcc ctcgctgaag aagctgcaaa gaagggacgg gataccttac 4380 aagaagctaa tgacattctc aacaacctga aagattttga taggcgcgtg aacgataaca 4440 agacggccgc agaggaggca ctaaggaaga ttcctgccat caaccagacc atcactgaag 4500 ccaatgaaaa gaccagagaa gcccagcagg ccctgggcag tgctgcggcg gatgccacag 4560 aggccaagaa caaggcccat gaggcggaga ggatcgcaag cgctgtccaa aagaatgcca 4620 ccagcaccaa ggcagaagct gaaagaactt ttgcagaagt tacagatctg gataatgagg 4680 tgaacaatat gttgaagcaa ctgcaggaag cagaaaaaga gctaaagaga aaacaagatg 4740 acgctgacca ggacatgatg atggcaggga tggcttcaca ggctgctcaa gaagccgaga 4800 tcaatgccag aaaagccaaa aactctgtta ctagcctcct cagcattatt aatgacctct 4860 tggagcagct ggggcagctg gatacagtgg acctgaataa gctaaacgag attgaaggca 4920 ccctaaacaa agccaaagat gaaatgaagg tcagcgatct tgataggaaa gtgtctgacc 4980 tggagaatga agccaagaag caggaggctg ccatcatgga ctataaccga gatatcgagg 5040 agatcatgaa ggacattcgc aatctggagg acatcaggaa gaccttacca tctggctgct 5100 tcaacacccc gtccattgaa aagccctagt gtctttaggg ctggaaggca gcatccctct 5160 gacagggggg cagttgtgag gccacagagt gccttgacac aaagattaca tttttcagac 5220 ccccactcct ctgctgctgt ccatcactgt ccttttgaac caggaaaagt cacagagttt 5280 aaagagaagc aaattaaaca tcctgaatcg ggaacaaagg gttttatcta ataaagtgtc 5340 tcttccatca cgttgctacc ttacccacac ttccctctga tttgcgtgag gacgtggcat 5400 cctacttacg tacgtggcat aacacatcgt gtgagcccat gtatgctggg gtagagcaag 5460 tagccctccc ctgtctcatc gatccagcag aacctcctca gtctcagtac tcttgtttct 5520 ataaggaaaa gttttgctac taacagtagc attgtgatgg ccagtatatc cagtccatgg 5580 ataaagaaaa tgcatctgca tctcctgccc ctcttccttc taagcaaaag gaaataaaca 5640 tcctgtgcca aaggtattgg tcatttagaa tgtcggtagc catccatcag tgcttttagc 5700 tattatgagt gtaggacact gagccatccg tgggtcagga tgcaattatt tataaaagtc 5760 cccaggtgaa catggctgaa gatttttcta gtatattaat aattgactag gaagatgaac 5820 tttttttcag atctttgggc agctgataat ttaaatctgg atgggcagct tgcactcacc 5880 aatagaccaa aagacatctt ttgatattct tataaatgga acttacacag aagaaatagg 5940 gatatgataa ccactaaagt tttgttttca aaatcaaact aattcttaca gcttttttat 6000 tagttagtct tggaactagt gttaagtatc tggcagagaa cagttaatcc ctaaggtctt 6060 gacaaaacag aagaaaaaca agcctcctcg tcctagtctt ttctagcaaa gggataaaac 6120 ttagatggca gcttgtactg tcagaatccc gtgtatccat ttgttcttct gttggagaga 6180 tgagacattt gacccttagc tccagttttc ttctgatgtt tccatcttcc agaatccctc 6240 aaaaaacatt gtttgccaaa tcctggtggc aaatacttgc actcagtatt tcacacagct 6300 gccaacgcta tcgagttcct gcactttgtg atttaaatcc actctaaacc ttccctctaa 6360 gtgtagaggg aagaccctta cgtggagttt cctagtgggc ttctcaactt ttgatcctca 6420 gctctgtggt tttaagacca cagtgtgaca gttccctgcc acacaccccc ttcctcctac 6480 caacccacct ttgagattca tatatagcct ttaacactat gcaactttgt actttgcgta 6540 gcaggggctg gggtgggggg aaagaaacct attatcatgg acacactggt gctattaatt 6600 atttcaaatt tatatttttg tgtgaatgtt ttgtgttttg tttatccatg ctatagaaca 6660 aggaatttat gtagatatac ttagtcctat ttctagaatg acactctgtt cactttgctc 6720 aatttttcct cttcactggc acaagtatct gaatacctcc ttccctccct tctagagttc 6780 tttggattgt actccaaaga attgtgcctt gtgtttgcag catctccatt ctctaaatta 6840 atataattgc tttcctccac acccagccac gtaaagaggt aacttgggtc ctcttccatt 6900 gcagtcctga tgatcctaac ctgcagcacg gtggttttac aatgttccag agcaggaacg 6960 ccaggttgac aagctatggt aggattagga aagtttgctg aagaggatct ttgacgccac 7020 agtgggacta gccaggaatg agggagaaat gccctttttg gcaattgttg gagctggata 7080 ggtaagtttt ataagggagt acattttgac tgagcactta gggcatcagg aacagtgcta 7140 cttactggtg ggtagactgg gagaggtggt gtaacttagt tcttgatgat cccacttcct 7200 gtttccatct gcttgggata taccagagtt taccacaagt gttttgacga tatactcctg 7260 agctttcact ctgctggctt ctcccaggcc tcttctacta tggcaggaga tgtggtgtgc 7320 tgttgcaaag ttttcacgtc atcgtttcct ggctagttca tttcattaag tggctacatc 7380 ctaacatatg cattggtcaa ggttgcagca agaggactga agattgactg ccaagctagt 7440 ttgggtgaag ttcactccag caagtctcag gccacaatgg ggtggtttgg tttggtttcc 7500 ttttaacttt ctttttgtta tttgcttttc tcctccacct gtgtggtata ttttttaagc 7560 agaattttat tttttaaaat aaaaggttct ttacaagatg ataccttaat tacactcccg 7620 caacacagcc attattttat tgtctagctc cagttatctg tattttatgt aatgtaattg 7680 acaggatggc tgctgcagaa tgctggttga cacagggatt attatactgc tatttttccc 7740 tgaattcttt tccttggaat tccaactgtg gaccttttat atgtgccttc actttagctg 7800 tttgccttac tctacagcct tgctctccgg ggtggttaat aaaatgcaac acttggcatt 7860 tttatgttat aagaaaaaca gtattttatt tataataaaa tctgaatatt ttgtaaccct 7920 tta 7923 298 3282 DNA Homo sapiens 298 atgagttggt tcaacgcctc ccagctctcc agcttcgcta agcaggccct gtcccaggcc 60 cagaagtcta ttgacagggt tctggacatc caggaagagg agccgagcat ctgggccgag 120 accattccgt atggagagcc gggaataagt tcccctgtca gtggaggatg ggatacttca 180 acctgggggt tgaaatcaaa cactgaacct cagagtccac caatagcctc tcctaaagca 240 atcacaaagc cagttcggag gactgtggtc gatgaatctg aaaatttctt cagtgccttt 300 ctctcgccaa ccgatgtcca gaccattcag aagagtccag tggtatcaaa acctccagca 360 aaatcacaac gaccagaaga agaagtgaaa agcagcttac atgaatcctt gcacattggc 420 cagtcaagaa ctcctgaaac aactgaatca caagtaaaag actcttcttt gtgtgtttca 480 ggggaaactc tggcagcagg tacttcatca cctaaaactg aaggcaagca cgaagaaact 540 gttaataaag aatcggatat gaaggtgcca actgtaagtt tgaaagtatc tgaaagtgta 600 attgatgtga aaacaactat ggaaagtata tctaatacgt ctacgcagtc tctcacagca 660 gaaacaaagg acatagcttt ggaacctaag gaacaaaaac atgaagacag gcagagcaat 720 acaccttctc ctcctgttag taccttttca tcaggtactt ctaccaccag tgatattgaa 780 gttttagatc atgaaagtgt aataagtgag agctcagcga gctcgagaca agagactaca 840 gattcaaaat caagtcttca cttgatgcag acatcttttc agcttctctc tgcatctgct 900 tgtcctgaat ataatcgttt agatgatttc caaaaactca ctgagagttg ctgttcatct 960 gatgcttttg aaagaataga ctcatttagt gtacagtcat tagatagccg gagtgtaagt 1020 gaaatcaatt cagatgatga attgtcaggc aagggatatg ctttagtgcc tattatagtt 1080 aattcttcaa ctccaaagtc taaaacagtt gaatctgctg aaggaaaatc tgaagaagta 1140 aatgaaacat tagttatacc cactgaggaa gcagaaatgg aagaaagtgg acgaagtgca 1200 actcctgtta actgtgaaca gcctgatatc ttggtttctt ctacaccaat aaatgaagga 1260 cagactgtgt tagacaaggt ggctgagcag tgtgaacctg ctgaaagtca gccagaagca 1320 ctttctgaga aggaagatgt ttgcaagaca gttgaatttc tgaatgaaaa gctggaaaaa 1380 agggaggctc agttattatc tcttagtaag gaaaaagcac ttctagaaga agcttttgat 1440 aacctgaaag atgaaatgtt cagagtgaaa gaagaaagca gtagcatttc ttccttgaaa 1500 gatgagttta ctcaaagaat tgcagaagca gaaaagaaag ttcaactagc ctgcaaagag 1560 agagatgctg ctaaaaagga aatcaaaaac ataaaagaag aacttgccac tagattaaat 1620 agtagtgaaa ctgcagacct tttgaaagag aaagatgagc agatccgagg gttaatggaa 1680 gaaggagaaa aactttcaaa acagcagctg cacaattcta acatcatcaa gaaattaaga 1740 gctaaagaca aggagaatga aaatatggtt gcaaagctga acaaaaaagt taaagagcta 1800 gaagaggagt tgcagcattt gaaacaggtc cttgatggca aagaagaggt tgagaaacaa 1860 catagagaaa atattaaaaa actaaattcc atggtagaac gccaagagaa agatcttggc 1920 cgtcttcagg tagacatgga tgaacttgaa gaaaagaacc gaagtattca ggctgccctg 1980 gatagtgcat acaaagaact tactgatctt cacaaagcca atgctgcaaa ggatagtgag 2040 gcacaggaag ctgctctgag ccgtgaaatg aaagctaaag aagaactttc tgcagcatta 2100 gagaaggccc aagaagaagc ccgtcagcag caagaaacat tagccattca agtgggggac 2160 cttaggcttg cattgcagcg tacagaacaa gcggctgcca gaaaggagga ttatttacgc 2220 catgagatcg gtgaacttca gcagagactc caggaagcag agaatcgaaa ccaggaactg 2280 agtcaaagtg tttcatcaac aacaagacca ttgcttcgac aaatagaaaa tttgcaagca 2340 accctgggat cccagacatc gtcgtgggag aaattagaga agaatctttc tgataggctt 2400 ggtgaatccc agaccttgct ggcagcagca gttgagagag aacgtgcagc tacagaagaa 2460 ctccttgcta acaaaattca gatgtcttcc atggagtcac agaattctct tttaagacag 2520 gaaaacagta gatttcaagc ccagctagaa tcagagaaaa ataggctgtg taaactggag 2580 gatgagaaca ataggtacca ggttgaattg gaaaacctaa aagatgaata tgtaagaaca 2640 cttgaagaga cgaggaaaga aaagacattg ttgaatagtc agttagaaat ggaaagaatg 2700 aaagttgaac aagaaaggaa gaaagccatt tttactcaag aaacaataaa agaaaaggaa 2760 cgcaagccat tttctgtttc tagcactccc accatgtcac gctcaagttc aataagtggt 2820 gttgatatgg caggactaca gacatctttt ctgtctcagg atgagtctca tgatcactca 2880 tttggaccaa tgcctatatc agccaaatgg aagcatcttt atgctgcctg taaggatggg 2940 agcaggatca agcatattga aaacctacag tctcagctaa agctaaggga aggggaaatc 3000 actcatttac agctagaaat tggcaatcta gaaaaaactc gatcaataat ggctgaagaa 3060 ctagttaaat taacaaatca aaatgatgaa cttgaagaga aggtgaagga gatacccaaa 3120 cttagaactc agctaagaga tttggatcaa aggtacaaca ctattctgca gatgtatgga 3180 gaaaaagcag aagaggcaga agaacttcga ttagatctcg aagatgtaaa aaatatgtac 3240 aaaactcaaa tagatgaact tttaagacaa agtctcagtt aa 3282 299 5900 DNA Homo sapiens 299 gctgtcgctg tgtttgcttt aacctgagtc ttgttcctta ttgtggttcc tgctgtggtt 60 ttgatcatgt tgttaccctc ggacgtagcc cggcttgtat tgggttactt acagcaagaa 120 aacctcattt ctacctgcca gacttttatt ttggaaagtt cagatttaaa agaatatgca 180 gaacattgta cagatgaagg gtttattcca gcctgcttac tgtccttatt tggaaaaaac 240 ttgacaacaa ttttaaatga gtatgtagct atgaaaacaa aagaaacatc aaataatgtc 300 ccagcaataa tgtcatctct atggaagaaa ttggaccata cactttctca gatcaggagc 360 atgcaaagtt ccccaaggtt tgctggcagt cagagagccc gaacgagaac tggaattgca 420 gaaatcaaac ggcagagaaa gcttgcatct caaacagctc cagccagtgc agagttgctc 480 actttacctt acctttcagg acagtttacc actcctcctt ccacaggtac acaggttact 540 cgaccaagtg gccaaatttc agatccatcg aggtcatatt ttgtagtggt caaccactca 600 cagtcacaag atactgtaac cactggagaa gctttaaatg tcattcctgg tgctcaggaa 660 aagaaagcac atgccagttt aatgtctccc ggtagacgca aaagtgaatc tcagagaaaa 720 agtaccactt tgtctggccc tcattcaaca atacggaatt tccaagatcc aaacgctttt 780 gcagtagaaa aacaaatggt tattgaaaat gcacgagaaa aaatactaag caacaaatct 840 cttcaagaaa agctagcaga aaacataaat aaatttttaa ctagtgataa caatattgcc 900 caagtaccta agcaaacaga taacaaccct acggagccag agacttcaat tgatgaattc 960 ctaggacttc cgagtgaaat tcacatgtct gaagaagcta tacaggacat attggaacag 1020 acagaatcag acccagcatt tcaggcactc tttgatctct ttgactatgg caaaacaaag 1080 aataataaaa atatatcaca aagtatttcc agtcaaccta tggaatccaa tcccagtata 1140 gtcttagcag atgaaactaa tctagcagtt aaaggttctt ttgaaacaga agaatctgat 1200 ggtcagtctg gtcagcccgc tttttgtaca tcctatcaga atgatgaccc attaaatgct 1260 ttgaagaata gcaacaacca tgatgtgctt agacaagaag accaggaaaa tttttcccaa 1320 ataagtacca gcatacagaa aaaggccttt aaaacagctg tacccactga acagaagtgt 1380 gacattgaca ttacctttga gtccgtgcct aatttgaatg actttaacca aagagggaat 1440 tctaatgctg aatgtaatcc acattgtgct gaattataca ccaatcagat gtccactgaa 1500 actgaaatgg ctatagggat tgaaaagaac tctttgtctt caaatgtacc gagtgaatct 1560 cagttacagc ctgatcagcc tgatatacca ataacttcat ttgtttcact tggttgtgaa 1620 gctaacaatg aaaacttaat tctctctggg aagagttctc aacttttatc ccaagatact 1680 tcattaactg gaaagccatc taaaaaaagt caattttgtg aaaattctaa tgatacagta 1740 aaacttaaaa ttaattttca tggttccaag tcatcagatt ctagtgaaat tcacaagagt 1800 aaaatagaaa ttaatgtgtt agaaccagtt atgtcacagc tatcaaattg ccaagataat 1860 tcttgtcttc aaagtgaaat actacctgtg tctgttgaaa gttcacattt aaatgtatct 1920 ggacaagtag aaattcatct tggagattcg ctgtcttcta ctaaacaacc atctaatgat 1980 tcagcatctg ttgagttaaa tcatacagaa aatgaagctc aggcatccaa gtctgagaat 2040 tcacaggagc cttcatcttc tgtaaaagaa gagaatacta tttttctctc tttaggtgga 2100 aatgctaact gtgagaaagt tgcactgacg cctccagaag gcactcctgt agaaaacagt 2160 cactctcttc ctccagaatc tgtgtgttct tcagtgggag attctcaccc tgagtcccaa 2220 aatactgatg ataaaccttc tagcaacaac tcagcagaga tagatgcatc aaatatcgtc 2280 tctctcaaag ttatcattag tgatgatcca tttgtttcct cagatactga acttaccagt 2340 gctgtttcta gtattaatgg agaaaacctg ccaactataa tcttgtcttc tcctactaaa 2400 tcacctacta aaaatgcaga actagttaaa tgcctatctt cagaagaaac tgtaggtgct 2460 gttgtatatg ccgaagtagg ggattcagcc tcaatggaac agagtctttt aacattcaaa 2520 tctgaagact ctgcagtaaa caatactcag aatgaagatg gcattgcttt ttcagctaat 2580 gttacaccat gtgtttccaa ggatggagga tatatacagt tgatgccagc cacaagcaca 2640 gcttttggca attcaaataa cattctgata gctacctgtg tgactgatcc aacagcgtta 2700 ggaacatctg taagtcagtc taatgtagtg gtgttgcctg gaaattctgc acctatgact 2760 gctcaacctc taccacctca gttacagaca ccaccaaggt caaacagtgt atttgctgtc 2820 aaccaagctg tgtcaccaaa cttttcacaa ggatctgcca taataattgc ctctccagtc 2880 cagcctgtac tccaaggaat ggtagggatg atcccagtat ctgtggttgg acagaatgga 2940 aataactttt ctactcctcc tcggcaggtt cttcatatgc ctttgacagc acctgtatgc 3000 aatagaagta tccctcaatt ccccgtccct ccaaaatctc agaaggctca gggactaaga 3060 aacaagcctt gtataggaaa acaagtaaat aatttggtgg attcgtcagg tcattcagtt 3120 ggatgtcatg cacaaaaaac tgaagtttct gacaaaagta ttgccacaga tcttgggaaa 3180 aaatcagaag aaaccacagt tcccttccca gaagagagta tagttccagc tgctaaacca 3240 tgccacagac gtgtactctg tttcgacagc actactgctc ctgtggcaaa tacgcagggg 3300 ccaaaccata agatggtgtc ccaaaacaaa gaaaggaatg cagtctcttt tcctaatctt 3360 gactcaccca atgtgtcctc caccttaaaa cccccttcta ataatgctat caaaagagag 3420 aaagagaagc ctcctctgcc taagatttta tctaaatcgg aaagtgccat tagccggcat 3480 accaccataa gagaaactca atcagaaaag aaagtttcac caacagaaat tgtgcttgaa 3540 tctttccata aagcaacagc taataaggag aatgaattat gcagcgatgt agaaagacag 3600 aaaaatccag aaaattcaaa actatctatt gggcagcaaa atgggggttt gcgaagtgag 3660 aaatctatag cttcactgca agaaatgacc aaaaaacaag gcacatcttc aaacaataaa 3720 aatgtacttt cagtaggtac agctgtgaag gatctaaaac aagaacaaac taaatccgcc 3780 agttctttga ttaccacaga aatgttacag gatatacaga ggcacagctc agtaagtagg 3840 cttgctgata gtagtgattt acctgtgccc cggacacctg gctcaggggc aggggaaaaa 3900 cataaagaag aacctataga tattatcaag gccccctcta gtaggcgttt cagtgaagac 3960 agtagtacat caaaagtaat ggtccctcct gtcaccccag acttgcctgc ctgcagccct 4020 gccagtgaaa caggaagtga aaacagtgta aatatggctg cccacacatt aatgattctc 4080 tccagggcag ccatttctag gactacttca gcaactcctc tgaaagataa cacacaacag 4140 tttagagcat cttcaaggag caccacaaaa aagcggaaaa ttgaggaatt agatgaacgt 4200 gagcgaaact ctcgtccttc tagtaaaaat cttacaaatt catcaatacc aatgaaaaag 4260 aagaaaatta agaaaaagaa gcttcccagt tcatttccag caggaatgga tgtagacaaa 4320 tttttgttat cattgcatta tgatgagtaa acattctgga cacataagac agtaggtgtt 4380 aaaaactcat atcccttaaa ctgagtgtag ggaatgggat attgacagaa tctgaaagca 4440 tgacctgcac tttcattgta ctgaaacttc actttatatc taaatcatgc tgtttctgaa 4500 acagcttcct agtttgtaaa tagacttact tggtatattt ttattttggg aaaaacgttt 4560 gcagaaatgt aagtaaagcc aatctgcaaa actgtatagc tttgacaatt ccatattgta 4620 aatactgtgt aaatcttgtt gaaatagagg ttaaatcaac ctaatgttct tacactgtgt 4680 tttttgactt cttatgatcc ctaattctcg agacttactc atctggaata gtttctcact 4740 tgttttggag gaaaatgatg cttattctta taaattacct tcagtaacta ttcaataaga 4800 catttaaata cacaatcact aagtactaca aaataataac cacctcaact gatagcaaaa 4860 tatctaatta gaaatcaaca gtttgatgag tttttttcct agagtgaatt actacccctt 4920 tcttataatt gctgtaagta ttatatttct ggaatttcac actttaccta ctctgatcac 4980 tctgttctct ttgttttaaa gagagaattt tgtaaaccat ttattgaatg ttctgatctt 5040 tcatttataa cttaagattt tttaacagaa ctttaagtgg ttagactatt gagccataaa 5100 attttggttt gtagtagaga tttctcggta tgtacttttc ccctaaatgc tggggattta 5160 aagaattgca ctaattcata gaactcttta cattgtagtg accaatgcac atataattta 5220 aatcttgtgt tctatgtgag agtttatgtg gaggattatc ttctggtgtt gtgtccatca 5280 taagatgtaa aaagagaaat catagtttta tattttagtt tctcctacaa tgggtatgtt 5340 aaataactat atctatactt agatatagtg cattgtctgt gtcccttaaa gtgttacaaa 5400 gagtttctct caaaaggtcc ttaaggagag tgagagatga gagaggtgcc ttctctgtat 5460 tagactcaca cacatcagtc cagttgaaga attggtttaa acttttaaat gagacagaaa 5520 ttataattag tacctgacca attgtgccta gatattaaag tatgttctct ttagtacttt 5580 ccgttcaatt aaatatcagt atattaggga tttttttcta caggtgtata ttttgttgaa 5640 gagtcacttt cctcaaccat ttttttgtac tgggccttta aaaaaataaa tccatccaac 5700 tttaacacaa catttctcag tgtagaaatc atgtcttctt aattgctgaa ccttactgca 5760 aaaacttgtg atgtaagaaa tttgtatggt gtggcagtgg tctattccta aggaactaaa 5820 tatcatatag ttaatgttta tttaactcag cttgagactg tactacagtt aggtttgaat 5880 aaatattttc attaacttca 5900 300 3859 DNA Homo sapiens 300 aatctatcag ggaacggcgg tggccggtgc ggcgtgttcg gtgcgctctg gccgctcagg 60 ccgtgcggct gggtgagcgc acgcgaggcg gcgaggcggc aagcgtgttt ctaggtcgtg 120 gcgtcgggct tccggagctt tggcggcagc taggggagga tggcggagtc ttcggataag 180 ctctatcgag tcgagtacgc caagagcggg cgcgcctctt gcaagaaatg cagcgagagc 240 atccccaagg actcgctccg gatggccatc atggtgcagt cgcccatgtt tgatggaaaa 300 gtcccacact ggtaccactt ctcctgcttc tggaaggtgg gccactccat ccggcaccct 360 gacgttgagg tggatgggtt ctctgagctt cggtgggatg accagcagaa agtcaagaag 420 acagcggaag ctggaggagt gacaggcaaa ggccaggatg gaattggtag caaggcagag 480 aagactctgg gtgactttgc agcagagtat gccaagtcca acagaagtac gtgcaagggg 540 tgtatggaga agatagaaaa gggccaggtg cgcctgtcca agaagatggt ggacccggag 600 aagccacagc taggcatgat tgaccgctgg taccatccag gctgctttgt caagaacagg 660 gaggagctgg gtttccggcc cgagtacagt gcgagtcagc tcaagggctt cagcctcctt 720 gctacagagg ataaagaagc cctgaagaag cagctcccag gagtcaagag tgaaggaaag 780 agaaaaggcg atgaggtgga tggagtggat gaagtggcga agaagaaatc taaaaaagaa 840 aaagacaagg atagtaagct tgaaaaagcc ctaaaggctc agaacgacct gatctggaac 900 atcaaggacg agctaaagaa agtgtgttca actaatgacc tgaaggagct actcatcttc 960 aacaagcagc aagtgccttc tggggagtcg gcgatcttgg accgagtagc tgatggcatg 1020 gtgttcggtg ccctccttcc ctgcgaggaa tgctcgggtc agctggtctt caagagcgat 1080 gcctattact gcactgggga cgtcactgcc tggaccaagt gtatggtcaa gacacagaca 1140 cccaaccgga aggagtgggt aaccccaaag gaattccgag aaatctctta cctcaagaaa 1200 ttgaaggtta aaaagcagga ccgtatattc cccccagaaa ccagcgcctc cgtggcggcc 1260 acgcctccgc cctccacagc ctcggctcct gctgctgtga actcctctgc ttcagcagat 1320 aagccattat ccaacatgaa gatcctgact ctcgggaagc tgtcccggaa caaggatgaa 1380 gtgaaggcca tgattgagaa actcgggggg aagttgacgg ggacggccaa caaggcttcc 1440 ctgtgcatca gcaccaaaaa ggaggtggaa aagatgaata agaagatgga ggaagtaaag 1500 gaagccaaca tccgagttgt gtctgaggac ttcctccagg acgtctccgc ctccaccaag 1560 agccttcagg agttgttctt agcgcacatc ttgtcccctt ggggggcaga ggtgaaggca 1620 gagcctgttg aagttgtggc cccaagaggg aagtcagggg ctgcgctctc caaaaaaagc 1680 aagggccagg tcaaggagga aggtatcaac aaatctgaaa agagaatgaa attaactctt 1740 aaaggaggag cagctgtgga tcctgattct ggactggaac actctgcgca tgtcctggag 1800 aaaggtggga aggtcttcag tgccaccctt ggcctggtgg acatcgttaa aggaaccaac 1860 tcctactaca agctgcagct tctggaggac gacaaggaaa acaggtattg gatattcagg 1920 tcctggggcc gtgtgggtac ggtgatcggt agcaacaaac tggaacagat gccgtccaag 1980 gaggatgcca ttgagcagtt catgaaatta tatgaagaaa aaaccgggaa cgcttggcac 2040 tccaaaaatt tcacgaagta tcccaaaaag ttttaccccc tggagattga ctatggccag 2100 gatgaagagg cagtgaagaa gctcacagta aatcctggca ccaagtccaa gctccccaag 2160 ccagttcagg acctcatcaa gatgatcttt gatgtggaaa gtatgaagaa agccatggtg 2220 gagtatgaga tcgaccttca gaagatgccc ttggggaagc tgagcaaaag gcagatccag 2280 gccgcatact ccatcctcag tgaggtccag caggcggtgt ctcagggcag cagcgactct 2340 cagatcctgg atctctcaaa tcgcttttac accctgatcc cccacgactt tgggatgaag 2400 aagcctccgc tcctgaacaa tgcagacagt gtgcaggcca aggtggaaat gcttgacaac 2460 ctgctggaca tcgaggtggc ctacagtctg ctcaggggag ggtctgatga tagcagcaag 2520 gatcccatcg atgtcaacta tgagaagctc aaaactgaca ttaaggtggt tgacagagat 2580 tctgaagaag ccgagatcat caggaagtat gttaagaaca ctcatgcaac cacacacagt 2640 gcgtatgact tggaagtcat cgatatcttt aagatagagc gtgaaggcga atgccagcgt 2700 tacaagccct ttaagcagct tcataaccga agattgctgt ggcacgggtc caggaccacc 2760 aactttgctg ggatcctgtc ccagggtctt cggatagccc cgcctgaagc gcccgtgaca 2820 ggctacatgt ttggtaaagg gatctatttc gctgacatgg tctccaagag tgccaactac 2880 taccatacgt ctcagggaga cccaataggc ttaatcctgt tgggagaagt tgcccttgga 2940 aacatgtatg aactgaagca cgcttcacat atcagcaggt tacccaaggg caagcacagt 3000 gtcaaaggtt tgggcaaaac tacccctgat ccttcagcta acattagtct ggatggtgta 3060 gacgttcctc ttgggaccgg gatttcatct ggtgtgatag acacctctct actatataac 3120 gagtacattg tctatgatat tgctcaggta aatctgaagt atctgctgaa actgaaattc 3180 aattttaaga cctccctgtg gtaattggga gaggtagccg agtcacaccc ggtggctgtg 3240 gtatgaattc acccgaagcg cttctgcacc aactcacctg gccgctaagt tgctgatggg 3300 tagtacctgt actaaaccac ctcagaaagg attttacaga aacgtgttaa aggttttctc 3360 taacttctca agtcccttgt tttgtgttgt gtctgtgggg aggggttgtt ttggggttgt 3420 ttttgttttt tcttgccagg tagataaaac tgacatagag aaaaggctgg agagagattc 3480 tgttgcatag actagtccta tggaaaaaac caaagcttcg ttagaatgtc tgccttactg 3540 gtttccccag ggaaggaaaa atacacttcc accctttttt ctaagtgttc gtctttagtt 3600 ttgattttgg aaagatgtta agcatttatt tttagttaaa ataaaaacta atttcatact 3660 atttagattt tcttttttat cttgcactta ttgtcccctt tttagttttt tttgtttgcc 3720 tcttgtggtg aggggtgtgg gaagaccaaa ggaaggaacg ctaacaattt ctcatactta 3780 gaaacaaaaa gagctttcct tctccaggaa tactgaacat gggagctctt gaaatatgta 3840 gtattaaaag ttgcatttg 3859 301 1631 DNA Homo sapiens 301 ggcaggcatg ggagccgcgc gctctctccc ggcgcccaca cctgtctgag cggcgcagcg 60 agccgcggcc cgggcgggct gctcggcgcg gaacagtgct cggcatggca gggattccag 120 ggctcctctt ccttctcttc tttctgctct gtgctgttgg gcaagtgagc ccttacagtg 180 ccccctggaa acccacttgg cctgcatacc gcctccctgt cgtcttgccc cagtctaccc 240 tcaatttagc caagccagac tttggagccg aagccaaatt agaagtatct tcttcatgtg 300 gaccccagtg tcataaggga actccactgc ccacttacga agaggccaag caatatctgt 360 cttatgaaac gctctatgcc aatggcagcc gcacagagac gcaggtgggc atctacatcc 420 tcagcagtag tggagatggg gcccaacacc gagactcagg gtcttcagga aagtctcgaa 480 ggaagcggca gatttatggc tatgacagca ggttcagcat ttttgggaag gacttcctgc 540 tcaactaccc tttctcaaca tcagtgaagt tatccacggg ctgcaccggc accctggtgg 600 cagagaagca tgtcctcaca gctgcccact gcatacacga tggaaaaacc tatgtgaaag 660 gaacccagaa gcttcgagtg ggcttcctaa agcccaagtt taaagatggt ggtcgagggg 720 ccaacgactc cacttcagcc atgcccgagc agatgaaatt tcagtggatc cgggtgaaac 780 gcacccatgt gcccaagggt tggatcaagg gcaatgccaa tgacatcggc atggattatg 840 attatgccct cctggaactc aaaaagcccc acaagagaaa atttatgaag attggggtga 900 gccctcctgc taagcagctg ccagggggca gaattcactt ctctggttat gacaatgacc 960 gaccaggcaa tttggtgtat cgcttctgtg acgtcaaaga cgagacctat gacttgctct 1020 accagcaatg cgatgcccag ccaggggcca gcgggtctgg ggtctatgtg aggatgtgga 1080 agagacagca gcagaagtgg gagcgaaaaa ttattggcat tttttcaggg caccagtggg 1140 tggacatgaa tggttcccca caggatttca acgtggctgt cagaatcact cctctcaaat 1200 atgcccagat ttgctattgg attaaaggaa actacctgga ttgtagggag gggtgacaca 1260 gtgttccctc ctggcagcaa ttaagggtct tcatgttctt attttaggag aggccaaatt 1320 gttttttgtc attggcgtgc acacgtgtgt gtgtgtgtgt gtgtgtgtgt aaggtgtctt 1380 ataatctttt acctatttct tacaattgca agatgactgg ctttactatt tgaaaactgg 1440 tttgtgtatc atatcatata tcatttaagc agtttgaagg catacttttg catagaaata 1500 aaaaaaatac tgatttgggg caatgaggaa tatttgacaa ttaagttaat cttcacgttt 1560 ttgcaaactt tgatttttat ttcatctgaa cttgtttcaa agatttatat taaatatttg 1620 gcatacaaga g 1631 302 5009 DNA Homo sapiens 302 ggcacgagga gaagcggccg cggcggtaga ggcggcagag acggtttctc catcttcccc 60 cctccccttc cccccttaga gtttccctcc ctccctccct ccttcccagt ctgggcaccc 120 aggcctgtga ccgcttcgtg ggcgcaaagg aggttgccag tccgctcttg cgggcctgtg 180 cccgcgccgt tccggggcct ccgtgctggg tgcgaggctc acgccgggag cgacgcacag 240 gctgagtggc tggagctttt gcgttcagtt cggtggcatt aagtacactc acgtccttgt 300 gcagccttcg ccaccatcct ccttcagaac tcattcatct ctcacagttg aacctttgca 360 cacatgaaac agtttgtagt gcatccctga gcatgagtgc agaatgaagc gacgtttgga 420 tgaccaggaa tcaccagtgt atgcagccca gcagcgaagg attcctggga gcacagaggc 480 tttttctcac cagcaccggg tccttgcccc ggcccctcct gtgtatgaag cagtgtctga 540 gaccatgcag tcagctacag gcattcagta ctcagtggca cccaactacc aggtttcagc 600 tgtgccacaa agttctggca gtcatgggcc cgccatagca gcagttcata gcagccatca 660 tcacccaaca gctgtccagc ctcatggagg ccaggtggtc cagagccatg cccacccagc 720 accaccagtt gcaccagtac agggacagca gcagtttcag aggctcaagg tggaagacgc 780 cctgtcctat cttgaccagg tgaaactgca gttcggtagt cagcctcagg tctacaatga 840 tttccttgac atcatgaagg aatttaaatc tcagagcatt gatactccag gagtgattag 900 ccgagtgtcc cagctattta aaggccaccc tgatctgatc atgggcttta acaccttctt 960 gcctcctggc tacaaaattg aggtgcagac taatgacatg gtgaacgtga caacacctgg 1020 ccaagttcat cagattccca cccatggcat ccagccccag cctcagccac cacctcagca 1080 tccttcccag ccttcatccc agtcagctcc cactcctgct cagccagctc ctcagcccac 1140 agctgccaaa gtcagcaagc cttcccaact acaagcacat actccagcca gtcagcagac 1200 tcccccactc ccaccatatg catccccacg ttctccacca gtccagcctc acacaccagt 1260 gacaatctcc ttggggacag ctccatcttt gcaaaacaat cagcctgtgg agtttaatca 1320 tgccatcaac tatgttaata agatcaagaa cagattccag ggccaaccag acatctacaa 1380 agcattcttg gagattttgc acacatacca gaaagaacag cggaatgcca aggaagctgg 1440 aggaaactac actccagctt tgactgagca agaggtgtat gcccaggtgg ctcgactctt 1500 caaaaaccag gaagatttgt tgtctgaatt tggacagttc ctgccagatg ccaacagctc 1560 agtgctttta agcaaaacaa ctgctgagaa ggttgattct gtgagaaatg accatggagg 1620 cactgtgaag aagccccaac tgaataacaa gccacagagg cccagtcaga atggctgcca 1680 gatccgcagg cactctggaa caggagccac acctccagtg aagaaaaaac ccaaactgat 1740 gagtctaaaa gagtcttcaa tggcagatgc cagcaagcat ggtgttggaa cggaatcatt 1800 attttttgat aaggttcgaa aggctcttcg gagtgcagag gcctatgaaa acttccttcg 1860 ttgccttgtt atctttaatc aggaggtgat ctctcgggcc gagcttgtac agctagtctc 1920 tccttttctg gggaaattcc ctgaattgtt taattggttt aaaaactttt tgggctataa 1980 ggagtctgta catctggaaa gctttccaaa ggaacgagct acagaaggca ttgccatgga 2040 gatagactat gcctcttgta aacgactggg ctctagctac cgagccctac cgaaaagtta 2100 ccagcagccc aagtgcacgg gacggactcc tctgtgtaaa gaggttttaa atgatacctg 2160 ggtttccttc ccatcttggt ctgaagactc cacttttgtt agttccaaga agactcagta 2220 tgaagaacat atttaccgtt gtgaagatga acgatttgag cttgatgtgg ttcttgagac 2280 caatcttgca acaatccggg ttttagaagc aatacagaaa aaactttctc gcttgtctgc 2340 tgaggaacaa gccaaatttc gcttggataa cacccttgga ggcacgtccg aagtcatcca 2400 tcgaaaagca ctccagagga tatatgctga caaagcagct gatatcatcg atggcctgag 2460 gaagaacccc tccattgctg ttccgattgt ccttaaaagg ttgaagatga aagaagaaga 2520 gtggcgagaa gctcagagag gcttcaacaa ggtctggcga gagcaaaatg agaagtacta 2580 cttgaagtct ctggatcacc aaggcatcaa cttcaagcag aacgacacta aggtcttgag 2640 gtctaagagc ttactcaatg agatcgagag catctatgac gagaggcaag agcaggctac 2700 agaagagaac gctggtgtac ctgttggccc gcacctctct cttgcctatg aagacaaaca 2760 gatactagaa gatgctgctg ctctgattat ccaccatgtg aagaggcaaa caggcattca 2820 gaaagaggac aaatacaaaa tcaagcaaat catgcaccat ttcattcctg acctgctgtt 2880 tgatcagaga ggcgatctct cagatgtgga agaagaggag gaggaagaaa tggatgtgga 2940 tgaagcaaca ggagcaccta agaagcacaa tggtgttggg ggcagccccc ctaagtccaa 3000 gttgctattt agtaacacag cagctcaaaa gttaagaggg atggatgaag tatataacct 3060 tttctatgtc aataacaatt ggtatatctt tatgcgactg catcaaattc tctgcttgag 3120 gctgctacgg atttgttccc aagctgaacg gcaaattgaa gaagaaaacc gagagagaga 3180 atgggaacgg gaggtgctag gcataaagcg agacaagagt gatagtcctg ccatacaact 3240 acgtctcaag gaacctatgg atgttgatgt agaagattat tacccagctt tcctggacat 3300 ggtgcggagc ctgcttgatg gcaacataga ctcatcacag tatgaagatt cattgagaga 3360 gatgttcacc attcatgcct acattgcctt tactatggac aaattaatcc agagcatcgt 3420 cagacagcta cagcacatcg tcagcgacga ggtctgtgtg caggttactg atctttactt 3480 ggcagaaaac aataacggag ccacgggagg ccagctcaac agtcagactt caaggagcct 3540 tctggagtca gcataccagc ggaaggcaga gcagcttatg tcagacgaga actgcttcaa 3600 gctaatgttc attcaaagtc aaggtcaagt tcagctgact gttgagctcc tggacacaga 3660 agaggagaac tcagatgacc ccgtggaagc agagcgttgg tcagactacg tggagcgata 3720 tatgagttct gatactactt ctcctgaact tcgagaacat ctggcacaga aaccagtatt 3780 tctcccaagg aatttgcggc gtatccggaa gtgtcaacgt ggtcgagagc aacaggaaaa 3840 agaagggaaa gaaggaaaca gcaagaagac catggaaaat gtagagagcc tggataagct 3900 ggagtgtagg ttcaagctga actcctataa gatggtatat gtgatcaaat cggaggacta 3960 catgtaccgg agaactgctc tactcagagc tcatcagtcc catgagcgtg taagcaagcg 4020 tctgcatcag cggttccagg cctgggtgga taaatggacc aaggagcatg tgcctcggga 4080 aatggcagca gagaccagca aatggctcat gggtgagggg ctcgagggcc tggtaccctg 4140 caccaccacc tgtgatacag agactctgca ctttgtgagc attaacaaat atcgtgtcaa 4200 atacggcaca gtattcaaag ccccttaact gcagagccag agcaggtagc tcagaggtgt 4260 gtgtgtgtgt atgtgtgtga atatgtgtgt gtatgtgtgt gtgtgggggc atgtgtactc 4320 atgtacactg aagaatcaag gaagatgcct ttcaagccac cagtgcaacc aggcaccagc 4380 aggatacacg aaaggaatgc gaatggatgg ataccttagg aagctggaac tggcactttt 4440 cccacgggta ctacccagac ccaacaggga atgaagtcca tgcaagcaca gagacaggca 4500 gcttgcttgc acacaccagc agagctcaac tacaacttcc tatgacctga gcaaccagat 4560 gctgtcccca tttggatgga tggatggagg tgcatcttgt cactgctcct tttcccacct 4620 ccaaaaaaga gaggggaggg gagccaggat ttgatgatac cctgacttga gcacagctaa 4680 gatctctgtt agattttatc tgagacctgt ttggctttgg aattaataac ttatggattt 4740 ggtttctgag tccttgaagt tcaccttctt ccctcttctg gttctcggcc tcaacaaatt 4800 ccagtcagtc ctccttttta gctcccatgg gactgttttg tacctgcttc tttactagtc 4860 tgccctagtg ccatccacca gcttctctct cacacgcaca catgcacaca catgcacaca 4920 cacacaattt taacttggtt tttctttttt gtaaataatg tacatactgt caatttttta 4980 ttaaagaaat atgctttgat gtgctagca 5009 303 1622 DNA Homo sapiens 303 agactgtcca ttgcatggct gtggagtgag actgccctta gcctgggtca gccttcctgg 60 gccataaatt gggcatccgt gatgctaggt aactgtggga acaaaatgac agcttagagc 120 agccatgggt gatgtttggt ggtaaaaaac ctacaggcgt ttggggtccc atgattgttc 180 cagaccatga ctcttcctgg ttgtgggttt gttacagagc aggagaagca gaggttatga 240 cagttatgca gactttcccc ctcctttttc tcttttctct tccccttgct tttccactgt 300 ttcttcctgc tgccacctgg gccttgaatt cctgggctgt gaagacatgt agcagctgca 360 gggtttacca cacgtgggag ggcagcccag tactgtccct ctgccttccc cactttgaga 420 atatggcagc ccctttcatt cctggcttgg ggtaggggag accattgaag tagaagcctc 480 aaagcagact tttcccttta ctgtgtgtac tccaggacga agaaggaaga tcatgcttga 540 tacttagatt ggttttccca gggaagaggg cggagcagag caaagtcact gtgaaccctg 600 ggccaggccc tggctgggcc agctcctgag agcgtctcgt gttgcagacc cttgcccact 660 tcacccacct gcaccttctc cccctctcac agtgtcactg ctgctaatgg tcaaagtcaa 720 atgtgtggcc acatgggatg ggccaggtcc tctcaggcta ctttctggat gtcattttta 780 aaatatggaa acatgcaggt gccttcccaa agaggcttgg actggtatat ccaacgagaa 840 acaaataagc taaagaaagt ttaaactcaa gaagaaagat gttgacagtc tatgtaacag 900 ctggaaagtt tataggcacc cacctttggg acaacccagt gattatgaac atgtgatatc 960 tactatttaa aagaaatgtt ctcaccttgg gttgattgtg gtataccatg tgttatgaaa 1020 attgttgagc tgaagctttg aatcgattta gttgagtctg actcacttgc tttggttcct 1080 gtgtatttta ctacccctct tgtcagtgac cttccttccc caccccaccc agagtgaatt 1140 tgtagcatga ttgtataaac ctctatgtag aaaatggaga tttcttgctc tgaaatgtta 1200 agctctaact gatccatttc tgtgtccttt agcctagtat gtctgaactt ccattcttgt 1260 tatatattta aactttccct ctatattata ggttttgtgg catccacggt caggtgtaga 1320 ggaagctgcc ccttgcagaa ctgtactgta atatttttct tttataaata ttttcacagg 1380 actgattgta cacagggctt gtaataaaat tctaacactg tgctgtgaaa caactatggg 1440 gaatctccat tgaaggctac ttcatgggca cctgaaagtg gagtgttata gctatgactt 1500 tctatttctt gtttcctaag taaattaaac ctaattttca ccctttcatt ctgtttcagc 1560 ctcctgtata agaagtaccg tattttctgc ccatcatact ttgtaataaa acttgaacat 1620 gt 1622 304 1610 DNA Homo sapiens 304 gtcgacttgt gagcaaatct tggtggtggt ggtggtgggg cggcctgaga agatattatg 60 gctgctgcca cggagcataa tcgcccgagc agcggtgaca ggaacctgga gcgaagatgc 120 agccccaacc tctcccgaga ggtgctctac gaaatctttc gctccctaca caccctggtt 180 ggacagcttg acctcagaga tgatgtggtg aaaattacaa tcgattggaa caagctccag 240 agcctctcgg cattccagcc tgcattgctc tttagtgcac ttgaacaaca cattttatat 300 ttacagcctt ttttagcaaa acttcagtct ccgattaaag aggagaatac aactgctgtt 360 gaagagatag gaagaacaga aatggggaac aaaaatgaag taaatgacaa attttccatt 420 ggcgacctac aagaggaaga aaagcacaaa gaaagtgatt taagagatgt gaaaaagaca 480 cagatccatt ttgatccaga agtagttcag ataaaggctg gaaaagcaga aattgacaga 540 cgaatatctg catttattga aagaaagcaa gctgaaatca atgaaaacaa cgtcagggaa 600 ttttgcaatg ttattgattg taatcaagaa aatagttgtg caagaactga tgcgattttt 660 accccttacc ccggatttaa aagtcacgta aaagtttcta gagttgtgaa tacatacgga 720 ccacagacta gacctgaagg aattccaggg tcaggtcata aacctaacag catgcttcga 780 gactgtggta atcaggctgt agaagaacga ctacaaaata ttgaggccca cttgcggtta 840 cagacaggtg gtccagtgcc aagagacatt tatcagagaa ttaaaaaact tgaggataaa 900 atccttgaat tggaaggcat ctctcctgaa tattttcagt ctgtaagctt ttctggaaaa 960 agaagaaaag ttcaaccacc tcaacagaac tattcactgg ctgaacttga tgagaaaatt 1020 agtgccctca aacaagccct cctcagaaaa tcaagagaag cagaatccat ggcaacccac 1080 caccttccat gaaaaactcc tctcgtcctt gtcattttac ttttttgtat acatatgtgt 1140 aacttacact gttattaatt caacggatat atatttatca atgtagactt ctctgtgaag 1200 tcaaggttta ttttcacatg agtccagctt gcagcaagta atcttcaaat attccagtaa 1260 gttttgaaat aatgaatgca ttgaaaatag cattgttgga atttattccc ttctgtttta 1320 aaggaggtat tttggaaatt gaaaaactta atttacaagt gctggtacat cttttttctt 1380 ttgtggacca ccggaaggaa gatttatttt gatattgata ctttaaacgg actttatatt 1440 tgtaacattg atgttaacca gtttattagt ttaaatcagt actttgcata taataagcat 1500 tcaacaaaaa tttattgact gaaatgaatt ttaaatgcat gtgtatgaac ttgtttttct 1560 taatctttaa gatgtcagta ttcatgtaaa taaacaatag atccagcttt 1610 305 5864 DNA Homo sapiens 305 ccaggagagc ggcgtggacg cgtgcgggcc tagaggccca cgtgatccgc agggcggccg 60 aggcaggaag cttgtgagtg cgcggttgcg gggtcgcatt gtggctacgg ctttgcgtcc 120 ccggcgggca gccccaggct ggtccccgcc tccgctctcc ccaccggcgg ggaaagcagc 180 tggtgtggga ggaaaggctc catcccccgc cccctctctc ccgctgttgg ctggcaggat 240 cttttggcag tcctgtggcc tcgctccccg cccggatcct cctgaccctg agattcgcgg 300 gtctcacgtc ccgtgcacgc cttgcttcgg cctcagttaa gcctttgtgg actccaggtc 360 cctggtgaga ttagaaacgt ttgcaaacat gtcccggatc gaaaagatga gcattctggg 420 cgtgcggagt tttggaatag aggacaaaga taagcaaatt atcactttct tcagccccct 480 tacaattttg gttggaccca atggggcggg aaagacgacc atcattgaat gtctaaaata 540 tatttgtact ggagatttcc ctcctggaac caaaggaaat acatttgtac acgatcccaa 600 ggttgctcaa gaaacagatg tgagagccca gattcgtctg caatttcgtg atgtcaatgg 660 agaacttata gctgtgcaaa gatctatggt gtgtactcag aaaagcaaaa agacagaatt 720 taaaactctg gaaggagtca ttactagaac aaagcatggt gaaaaggtca gtctgagctc 780 taagtgtgca gaaattgacc gagaaatgat cagttctctt ggggtttcca aggctgtgct 840 aaataatgtc attttctgtc atcaagaaga ttctaattgg cctttaagtg aaggaaaggc 900 tttgaagcaa aagtttgatg agattttttc agcaacaaga tacattaaag ccttagaaac 960 acttcggcag gtacgtcaga cacaaggtca gaaagtaaaa gaatatcaaa tggaactaaa 1020 atatctgaag caatataagg aaaaagcttg tgagattcgt gatcagatta caagtaagga 1080 agcccagtta acatcttcaa aggaaattgt caaatcctat gagaatgaac ttgatccatt 1140 gaagaatcgt ctaaaagaaa ttgaacataa tctctctaaa ataatgaaac ttgacaatga 1200 aattaaagcc ttggatagcc gaaagaagca aatggagaaa gataatagtg aactggaaga 1260 gaaaatggaa aaggtttttc aagggactga tgagcaacta aatgacttat atcacaatca 1320 ccagagaaca gtaagggaga aagaaaggaa attggtagac tgtcatcgtg aactggaaaa 1380 actaaataaa gaatctaggc ttctcaatca ggaaaaatca gaactgcttg ttgaacaggg 1440 tcgtctacag ctgcaagcag atcgccatca agaacatatc cgagctagag attcattaat 1500 tcagtctttg gcaacacagc tagaattgga tggctttgag cgtggaccat tcagtgaaag 1560 acagattaaa aattttcaca aacttgtgag agagagacaa gaaggggaag caaaaactgc 1620 caaccaactg atgaatgact ttgcagaaaa agagactctg aaacaaaaac agatagatga 1680 gataagagat aagaaaactg gactgggaag aataattgag ttaaaatcag aaatcctaag 1740 taagaagcag aatgagctga aaaatgtgaa gtatgaatta cagcagttgg aaggatcttc 1800 agacaggatt cttgaactgg accaggagct cataaaagct gaacgtgagt taagcaaggc 1860 tgagaaaaac agcaatgtag aaaccttaaa aatggaagta ataagtctcc aaaatgaaaa 1920 agcagactta gacaggaccc tgcgtaaact tgaccaggag atggagcagt taaaccatca 1980 tacaacaaca cgtacccaaa tggagatgct gaccaaagac aaagctgaca aagatgaaca 2040 aatcagaaaa ataaaatcta ggcacagtga tgaattaacc tcactgttgg gatattttcc 2100 caacaaaaaa cagcttgaag actggctaca tagtaaatca aaagaaatta atcagaccag 2160 ggacagactt gccaaattga acaaggaact agcttcatct gagcagaata aaaatcatat 2220 aaataatgaa ctaaaaagaa aggaagagca gttgtccagt tacgaagaca agctgtttga 2280 tgtttgtggt agccaggatt ttgaaagtga tttagacagg cttaaagagg aaattgaaaa 2340 atcatcaaaa cagcgagcca tgctggctgg agccacagca gtttactccc agttcattac 2400 tcagctaaca gacgaaaacc agtcatgttg ccccgtttgt cagagagttt ttcagacaga 2460 ggctgagtta caagaagtca tcagtgattt gcagtctaaa ctgcgacttg ctccagataa 2520 actcaagtca acagaatcag agctaaaaaa aaaggaaaag cggcgtgatg aaatgctggg 2580 acttgtgccc atgaggcaaa gcataattga tttgaaggag aaggaaatac cagaattaag 2640 aaacaaactg cagaatgtca atagagacat acagcgccta aagaacgaca tagaagaaca 2700 agaaacactc ttgggtacaa taatgcctga agaagaaagt gccaaagtat gcctgacaga 2760 tgttacaatt atggagaggt tccagatgga acttaaagat gttgaaagaa aaattgcaca 2820 acaagcagct aagctacaag gaatagactt agatcgaact gtccaacaag tcaaccagga 2880 gaaacaagag aaacagcaca agttagacac agtttctagt aagattgaat tgaatcgtaa 2940 gcttatacag gaccagcagg aacagattca acatctaaaa agtacaacaa atgagctaaa 3000 atctgagaaa cttcagatat ccactaattt gcaacgtcgt cagcaactgg aggagcagac 3060 tgtggaatta tccactgaag ttcagtcttt gtacagagag ataaaggatg ctaaagagca 3120 ggtaagccct ttggaaacaa cattggaaaa gttccagcaa gaaaaagaag aattaatcaa 3180 caaaaaaaat acaagcaaca aaatagcaca ggataaactg aatgatatta aagagaaggt 3240 taaaaatatt catggctata tgaaagacat tgagaattat attcaagatg ggaaagacga 3300 ctataagaag caaaaagaaa ctgaacttaa taaagtaata gctcaactaa gtgaatgcga 3360 gaaacacaaa gaaaagataa atgaagatat gagactcatg agacaagata ttgatacaca 3420 gaagatacaa gaaaggtggc tacaagataa ccttacttta agaaaaagaa atgaggaact 3480 aaaagaagtt gaagaagaaa gaaaacaaca tttgaaggaa atgggtcaaa tgcaggtttt 3540 gcaaatgaaa agtgaacatc agaagttgga agagaacata gacaatataa aaagaaatca 3600 taatttggca ttagggcgac agaaaggtta tgaagaagaa attattcatt ttaagaaaga 3660 acttcgagaa ccacaatttc gggatgctga ggaaaagtat agagaaatga tgattgttat 3720 gaggacaaca gaacttgtga acaaggatct ggatatttat tataagactc ttgaccaagc 3780 aataatgaaa tttcacagta tgaaaatgga agaaatcaat aaaattatac gtgacctgtg 3840 gcgaagtacc tatcgtggac aagatattga atacatagaa atacggtctg atgccgatga 3900 aaatgtatca gcttctgata aaaggcggaa ttataactac cgagtggtga tgctgaaggg 3960 agacacagcc ttggatatgc gaggacgatg cagtgctgga caaaaggtat tagcctcact 4020 catcattcgc ctggccctgg ctgaaacgtt ctgcctcaac tgtggcatca ttgccttgga 4080 tgagccaaca acaaatcttg accgagaaaa cattgaatct cttgcacatg ctctggttga 4140 gataataaaa agtcgctcac agcagcgtaa cttccagctt ctggtaatca ctcatgatga 4200 agattttgtg gagcttttag gacgttctga atatgtggag aaattctaca ggattaaaaa 4260 gaacatcgat cagtgctcag agattgtgaa atgcagtgtt agctccctgg gattcaatgt 4320 tcattaaaaa tatccaagat ttaaatgcca tagaaatgta ggtcctcaga aagtgtataa 4380 taagaaactt atttctcata tcaacttagt caataagaaa atatattctt tcaaaggaac 4440 attgtgtcta ggattttgga tgttgagagg ttctaaaatc atgaaacttg tttcactgaa 4500 aattggacag attgcctgtt tctgatttgc tgctcttcat cccattccag gcagcctctg 4560 tcaggccttc agggttcagc agtacagccg agactcgact ctgtgcctcc ctccccagtg 4620 caaatgcatg cttcttctca aagcactgtt gagaaggaga taattactgc cttgaaaatt 4680 tatggttttg gtattttttt aaatcatagt taaatgttac ctctgaattt acttccttgc 4740 atgtggtttg aaaaactgag tattaatatc tgaggatgac cagaaatggt gagatgtatg 4800 tttggctctg cttttaactt tataaatcca gtgacctctc tctctgggac ttggtttccc 4860 caactaaaat ttgaagtagt tgaatggggt ctcaaagttt gacaggaacc ttaagtaatc 4920 atctaagtca gtacccacca ccttcttctc ctacatatcc cttccagatg gtcatccaga 4980 ctcagagctc tctctacaga gaggaaattc tccactgtgc acacccacct ttggaaagct 5040 ctgaccactt gaggcctgat ctgcccatcg tgaagaagcc tgtaacactc ctctgcgtct 5100 atcctgtgta gcatactggc ttcaccatca atcctgattc ctctctaagt gggcattgcc 5160 atgtggaagg caagccaggc tcactcacag agtcaaggcc tgctccctgt agggtccaac 5220 cagacctgga agaacaggcc tctccatttg ctcttcagat gccacttcta agaaaagcct 5280 aatcacagtt tttcctggaa ttgccagctg acatcttgaa tccttccatt ccacacagaa 5340 tgcaaccaag tcacacgctt ttgaattatg ctttgtagag ttttgtcatt cagagtcagc 5400 caggaccata ccgggtcttg attcagtcac atggcatggt tttgtgccat ctgtagctat 5460 aatgagcatg tttgcctaga cagcttttct caactgggtc cagaagagaa ttaagcccta 5520 aggtcctaag gcatctatct gtgctaggtt aaatggttgg cccccaaaga tagacaggtc 5580 ctgatttcta gaacccgtga ctgttacttt atacagcaaa ggaaactttg cagatgtgat 5640 taaagctaag gaccttaaga cagagtatcc tgggggtggt ggtggggtgg gggggggtcc 5700 taaatgtaat cacgagtaag attaagagca aatcaattct agtcatatat taaacatcca 5760 caataaccaa gatattttta tcccaagaat gcaagatttc agaaaatgaa aaatctgttg 5820 ataaatccat cactataata aaaccgaagg tgaaaaaaat tctg 5864 306 3231 DNA Homo sapiens 306 ggagatatta cggcggcaga tccgcctact gcagggtctg attgatgact acaaaaccct 60 ccacggcaat gccccggccc ctggtacccc agcagcttct gggtggcagc cacccactta 120 ccacagtggc agagccttta gtgcccgcta ccctcgtcca agccggaggg gctactcttc 180 ccaccatggg ccttcgtggc gcaagaaata ctccctcgtg aatcggcccc cgggaccctc 240 agaccctcct gccgaccatg ctgtgcggcc gttgcacggg gcccgggggg gccagcctcc 300 tgtcccgcag cagcatgtcc ttgagagaca ggtccagctc agtcagggtc agaacgtggt 360 catcaaagtt aaaccgccat caaagtctgg ctctgccagt gcctcagggg cccagcgggg 420 ctctttggaa gaatttgagg acaccccctg gagtgaccaa aggccccggg aaggtgaagg 480 tgagccccct cggggacagc tgcagccctc gaggccaaca agagccaggg ggacctgcag 540 tgtggaagat cctcttctgg tctgccagaa ggagcctggt aagcccagga tggtgaagtc 600 agtgggcagt gtgggcgaca gcccccggga gccccgccgg acagtcagtg agagtgtgat 660 tgccgtcaag gcgagcttcc catcctccgc tctgccccca cgcactggcg tggccctggg 720 ccggaagctg ggttctcatt ccgtggccag ctgtgctcca cagctccttg gggacaggag 780 agtagatgct ggccacacag atcagccagt tccgtctggc tcagtggggg gccccgccag 840 accggcctca ggacccaggc aggcccggga ggcctcgctg gttgtgacct gtcgaactaa 900 caagttccgg aaaaacaact acaaatgggt ggctgcctcc tcgaagagtc cccgggttgc 960 tcggagggcc ctcagtccca gagtggctgc agagaatgtg tgcaaggcct ctgctggcat 1020 ggcaaacaag gtggagaagc cgcagctcat agctgaccca gagcccaagc ccaggaagcc 1080 agccacgtcc tccaagccag ggtctgcccc cagcaagtac aagtggaagg cctccagccc 1140 ctctgcctcc tcctcttcct ccttccgttg gcagtcggag gccggcagca aggaccatgc 1200 ctcccagctc tccccagtcc tgtctaggtc cccgtcgggg gacagaccag cagtaggaca 1260 cagtggcttg aagcccctct ctggggagac cccgctctcg gcttacaaag tgaagagccg 1320 caccaagatc atccggagac gcggcagcac aagccttcct ggagacaaga aaagcggcac 1380 ctcacctgcc gccaccgcca agagccacct cagcctccgg cggagacagg ccctcagggg 1440 gaagagcagc cctgtcctga agaagacccc caacaagggc ctggtacagg tcaccacgca 1500 ccgactatgt cgcctgccac cgagccgggc ccacctcccc accaaggaag cgtccagcct 1560 gcatgccgtg cggactgcac ccaccagcaa ggtgatcaag acccgctacc gcattgtcaa 1620 gaagacgccg gcctcgcctc tcagcgcccc gcccttcccc ctgtctctgc cctcctggcg 1680 ggcccggcgg ctctcactat ccaggtccct ggtgctgaac cgcctgcgtc cagttgccag 1740 cgggggtggg aaagcccaac cgggctcccc ttggtggcgg agcaaaggct accgctgcat 1800 cggaggggtc ctctacaaag tatctgccaa caagctctcc aagacctccg gccagcccag 1860 tgatgcgggc agcaggcccc tcctgcgcac aggccggctg gatcctgcag gcagctgtag 1920 ccgttccctg gccagccggg cagtgcagcg cagcctggcc atcatccggc aggcgcggca 1980 gcgcagggag aagaggaagg agtactgcat gtactacaac cgcttcggca ggtgcaaccg 2040 tggcgagcgc tgcccctaca tccacgatcc cgagaaggtg gccgtgtgca ccaggtttgt 2100 ccggggcacc tgcaagaaaa cggatgggac ctgccccttc tcccaccatg tgtccaagga 2160 gaagatgccg gtgtgctcct acttcctgaa gggcatctgc agcaacagca actgtcccta 2220 tagccacgtg tacgtgtccc gcaaggccga ggtctgcagc gacttcctca aaggctactg 2280 ccccctgggt gcaaagtgca agaagaaaca cacgctgctg tgccccgact ttgcccgcag 2340 gggggcgtgt ccccgcggcg cccagtgcca gctgctccac cgtacccaga aacgccacag 2400 tcggcgggca gccacgtccc ccgccccagg gcccagcgac gcaaccgcca ggagcagggt 2460 ctcggccagc cacgggccca ggaagccttc agcatcccag cgccccacca ggcagacgcc 2520 cagctcggct gccctcactg cggctgccgt ggctgcacct ccccactgcc caggggggtc 2580 agcctctccc tcatcctcga aggcttcctc ctcctcctcc tcctcctcat cccctcccgc 2640 ttccttggac cacgaggcac catctctcca ggaggctgcc ttagcagcag cgtgctccaa 2700 caggctctgc aagctgcctt ccttcatctc cctgcagtcc tcgccgagcc caggagccca 2760 gcccagggtc cgggccccta gggcccccct caccaaggac tcagggaagc ctctgcacat 2820 caaaccacgt ctgtgaggac cccagggacc ggcctgcacc tacctcagac cctcatcctt 2880 ggagaggaaa gaggctctgt ccaccactct accccacagg agggccgccc gccaccaagc 2940 ctcacctggg ggccacaggg acactgctct gcctgcctgg ccctcaacct tccatgacca 3000 gcgtgtgcgc agggcctggt cttcctcccc caagccaggc ccctgtcccc accccaccac 3060 cttccagggt gccaggcagg gctggcctcc aggcctgtcc ccgactgcca ttggcaacag 3120 tggccctgca gcccccagcc ctccccaccc aggttttcgg cccagtgaag aggccactgg 3180 ccaggcctcc caggcaggtg ttttatgttc agcaataaag gttctatccg t 3231 307 3316 DNA Homo sapiens 307 gatcacaggt ctatcaccct attaaccact cacgggagct ctccatgcat ttggtatttt 60 cgtctggggg gtatgcacgc gatagcattg cgagacgctg gagccggagc accctatgtc 120 gcagtatctg tctttgattc ctgcctcatc ctattattta tcgcacctac gttcaatatt 180 acaggcgaac atacttacta aagtgtgtta attaattaat gcttgtagga cataataata 240 acaattgaat gtctgcacag ccactttcca cacagacatc ataacaaaaa atttccacca 300 aaccccccct cccccgcttc tggccacagc acttaaacac atctctgcca aaccccaaaa 360 acaaagaacc ctaacaccag cctaaccaga tttcaaattt tatcttttgg cggtatgcac 420 ttttaacagt caccccccaa ctaacacatt attttcccct cccactccca tactactaat 480 ctcatcaata caacccccgc ccatcctacc cagcacacac acaccgctgc taaccccata 540 ccccgaacca accaaacccc aaagacaccc cccacagttt atgtagctta cctcctcaaa 600 gcaatacact gaaaatgttt agacgggctc acatcacccc ataaacaaat aggtttggtc 660 ctagcctttc tattagctct tagtaagatt acacatgcaa gcatccccgt tccagtgagt 720 tcaccctcta aatcaccacg atcaaaagga acaagcatca agcacgcagc aatgcagctc 780 aaaacgctta gcctagccac acccccacgg gaaacagcag tgattaacct ttagcaataa 840 acgaaagttt aactaagcta tactaacccc agggttggtc aatttcgtgc cagccaccgc 900 ggtcacacga ttaacccaag tcaatagaat ccggcgtaaa gagtgtttta gatcaccccc 960 tccccaataa agctaaaact cacctgagtt gtaaaaaact ccagttgaca caaaatagac 1020 tacgaaagtg gctttaacat atctgaacac agaatagcta agacccaaac tgggattaga 1080 taccccacta tgcttagccc taaacctcaa cagttaaatc aacaaaactg ctcgccagaa 1140 cactacgagc cacagcttaa aactcaaagg acctggcggt gcttcatatc cctctagagg 1200 agcctgttct gtaatcgata aaccccgatc aacctcacca cctcttgctc agcctatata 1260 ccgccatctt cagcaaaccc tgatgaaggc tacaaagtaa gcgcaagtac ccacgtaaag 1320 acgttaggtc aaggtgtagc ccatgaggtg gcaagaaatg ggctacattt tctaccccag 1380 aaaactacga tagcccttat gaaacttaag ggtcgaaggt ggatttagca gtaaactaag 1440 agtagagtgc ttagttgaac agggccctga agcgcgtaca caccgcccgt caccctcctc 1500 aagtatactt caaaggacat ttaactaaaa cccctacgca tttatataga ggagacaagt 1560 cgtaacatgg taagtgtact ggaaagtgca cttggacgaa ccagagtgta gcttaacaca 1620 aagcacccaa cttacactta ggagatttca acttaacttg accgctctga gctaaaccta 1680 gccccaaacc cactccacct tactaccaga caaccttagc caaaccattt acccaaataa 1740 agtataggcg atagaaattg aaacctggcg caatagatat agtaccgcaa gggaaagatg 1800 aaaaattata accaagcata atatagcaag gactaacccc tataccttct gcataatgaa 1860 ttaactagaa ataactttgc aaggagagcc aaagctaaga cccccgaaac cagacgagct 1920 acctaagaac agctaaaaga gcacacccgt ctatgtagca aaatagtggg aagatttata 1980 ggtagaggcg acaaacctac cgagcctggt gatagctggt tgtccaagat agaatcttag 2040 ttcaacttta aatttgccca cagaaccctc taaatcccct tgtaaattta actgttagtc 2100 caaagaggaa cagctctttg gacactagga aaaaaccttg tagagagagt aaaaaattta 2160 acacccatag taggcctaaa agcagccacc aattaagaaa gcgttcaagc tcaacaccca 2220 ctacctaaaa aatcccaaac atataactga actcctcaca cccaattgga ccaatctatc 2280 accctataga agaactaatg ttagtataag taacatgaaa acattctcct ccgcataagc 2340 ctgcgtcaga ttaaaacact gaactgacaa ttaacagccc aatatctaca atcaaccaac 2400 aagtcattat taccctcact gtcaacccaa cacaggcatg ctcataagga aaggttaaaa 2460 aaagtaaaag gaactcggca aatcttaccc cgcctgttta ccaaaaacat cacctctagc 2520 atcaccagta ttagaggcac cgcctgccca gtgacacatg tttaacggcc gcggtaccct 2580 aaccgtgcaa aggtagcata atcacttgtt ccttaaatag ggacctgtat gaatggctcc 2640 acgagggttc agctgtctct tacttttaac cagtgaaatt gacctgcccg tgaagaggcg 2700 ggcataacac agcaagacga gaagacccta tggagcttta atttattaat gcaaacagta 2760 cctaacaaac ccacaggtcc taaactacca aacctgcatt aaaaatttcg gttggggcga 2820 cctcggagca gaacccaacc tccgagcagt acatgctaag acttcaccag tcaaagcgaa 2880 ctactatact caattgatcc aataacttga ccaacggaac aagttaccct agggataaca 2940 gcgcaatcct attctagagt ccatatcaac aatagggttt acgacctcga tgttggatca 3000 ggacatcccg atggtgcagc cgctattaaa ggttcgtttg ttcaacgatt aaagtcctac 3060 gtgatctgag ttcagaccgg agtaatccag gtcggtttct atctaccttc aaattcctcc 3120 ctgtacgaaa ggacaagaga aataaggcct acttcacaaa gcgccttccc ccgtaaatga 3180 tatcatctca acttagtatt atacccacac ccacccaaga acagggtttg ttaagatggc 3240 agagcccggt aatcgcataa aacttaaaac tttacagtca gaggttcaat tcctcttctt 3300 aacaacatac ccatgg 3316 308 852 DNA Homo sapiens 308 tctctctctc tctctctctc tctggtgaac aggacccgtc gccatgggcc gtgtgatccg 60 tggacagagg aagggcgccg ggtctgtgtt ccgcgcgcac gtgaagcacc gtaaaggcgc 120 tgcgcgcctg cgcgccgtgg atttcgctga gcggcacggc tacatcaagg gcatcgtcaa 180 ggacatcatc cacgacccgg gccgcggcgc gcccctcgcc aaggtggtct tccgggatcc 240 gtatcggttt aagaagcgga cggagctgtt cattgccgcc gagggcattc acacgggcca 300 gtttgtgtat tgcggcaaga aggcccagct caacattggc aatgtgctcc ctgtgggcac 360 catgcctgag ggtacaatcg tgtgctgcct ggaggagaag cctggagacc gtggcaagct 420 ggcccgggca tcagggaact atgccaccgt tatctcccac aaccctgaga ccaagaagac 480 ccgtgtgaag ctgccctccg gctccaagaa ggttatctcc tcagccaaca gagctgtggt 540 tggtgtggtg gctggaggtg gccgaattga caaacccatc ttgaaggctg gccgggcgta 600 ccacaaatat aaggcaaaga ggaactgctg gccacgagta cggggtgtgg ccatgaatcc 660 tgtggagcat ccttttggag gtggcaacca ccagcacatc ggcaagccct ccaccatccg 720 cagagatgcc cctgctggcc gcaaagtggg tctcattgct gcccgccgga ctggacgtct 780 ccggggaacc aagactgtgc aggagaaaga gaactagtgc tgagggcctc aataaagttt 840 gtgtttatgc ca 852 309 814 DNA Homo sapiens 309 agaatcccgg acagccctgc tccctgcagc caggtgtagt ttcgggagcc actggggcca 60 aagtgagagt ccagcggtct tccagcgctt gggccacggc ggcggccctg ggagcagagg 120 tggagcgacc ccattacgct aaagatgaaa ggctggggtt ggctggccct gcttctgggg 180 gccctgctgg gaaccgcctg ggctcggagg agccaggatc tccactgtgg agcatgcagg 240 gctctggtgg atgaactaga atgggaaatt gcccaggtgg accccaagaa gaccattcag 300 atgggatctt tccggatcaa tccagatggc agccagtcag tggtggaggt gccttatgcc 360 cgctcagagg cccacctcac agagctgctg gaggagatat gtgaccggat gaaggagtat 420 ggggaacaga ttgatccttc cacccatcgc aagaactacg tacgtgtagt gggccggaat 480 ggagaatcca gtgaactgga cctacaaggc atccgaatcg actcagatat tagcggcacc 540 ctcaagtttg cgtgtgagag cattgtggag gaatacgagg atgaactcat tgaattcttt 600 tcccgagagg ctgacaatgt taaagacaaa ctttgcagta agcgaacaga tctttgtgac 660 catgccctgc acatatcgca tgatgagcta tgaaccactg gagcagccca cactggcttg 720 atggatcacc cccaggaggg gaaaatggtg gcaatgcctt ttatatatta tgtttttact 780 gaaattaact gaaaaaatat gaaaccaaaa gtac 814 310 1362 DNA Homo sapiens 310 atttgggga cgctctcagc tctcggcgca cggcccagct tccttcaaaa tgtctactgt 60 cacgaaatc ctgtgcaagc tcagcttgga gggtgatcac tctacacccc caagtgcata 120 gggtctgtc aaagcctata ctaactttga tgctgagcgg gatgctttga acattgaaac 180 gccatcaag accaaaggtg tggatgaggt caccattgtc aacattttga ccaaccgcag 240 aatgcacag agacaggata ttgccttcgc ctaccagaga aggaccaaaa aggaacttgc 300 tcagcactg aagtcagcct tatctggcca cctggagacg gtgattttgg gcctattgaa 360 acacctgct cagtatgacg cttctgagct aaaagcttcc atgaaggggc tgggaaccga 420 gaggactct ctcattgaga tcatctgctc cagaaccaac caggagctgc aggaaattaa 480 agagtctac aaggaaatgt acaagactga tctggagaag gacattattt cggacacatc 540 ggtgacttc cgcaagctga tggttgccct ggcaaagggt agaagagcag aggatggctc 600 gtcattgat tatgaactga ttgaccaaga tgctcgggat ctctatgacg ctggagtgaa 660 aggaaagga actgatgttc ccaagtggat cagcatcatg accgagcgga gcgtgcccca 720 ctccagaaa gtatttgata ggtacaagag ttacagccct tatgacatgt tggaaagcat 780 aggaaagag gttaaaggag acctggaaaa tgctttcctg aacctggttc agtgcattca 840 aacaagccc ctgtattttg ctgatcggct gtatgactcc atgaagggca aggggacgcg 900 gataaggtc ctgatcagaa tcatggtctc ccgcagtgaa gtggacatgt tgaaaattag 960 tctgaattc aagagaaagt acggcaagtc cctgtactat tatatccagc aagacactaa 1020 ggcgactac cagaaagcgc tgctgtacct gtgtggtgga gatgactgaa gcccgacacg 1080 cctgagcgt ccagaaatgg tgctcaccat gcttccagct aacaggtcta gaaaaccagc 1140 tgcgaataa cagtccccgt ggccatccct gtgagggtga cgttagcatt acccccaacc 1200 cattttagt tgcctaagca ttgcctggcc ttcctgtcta gtctctcctg taagccaaag 1260 aatgaacat tccaaggagt tggaagtgaa gtctatgatg tgaaacactt tgcctcctgt 1320 gtactgtgtc ataaacagat gaataaactg aatttgtact tt 1362 311 3035 DNA Homo sapiens 311 gcggggcggg ccggcggcgg aggccgggcc gcggagccag gagtgactag cagcagttgg 60 ccgtgccgta gcagcgtccc gcgcgcggcg ggcagcggcc caggaggcgc gtggtgcggg 120 tttcggcggc ggctgaggaa gaagcgcggg cggcgccttc gggaggcgag caggcagcag 180 ttggccgtgc cgtagcagcg tcccgcgcgc ggcgggcagc ggcccaggag gcgcgtggcg 240 gcgctcggcc tcgcggcggc ggcggcggca gcggcccagc agttggcggc gagcgcgtct 300 gcgcctgcgc ggcgggcccc gcgcccctcc tccccccctg ggcgcccccg gcggcgtgtg 360 aatggcggcc tccgcggcgg cagcctcggc agcagcggcc tcggccgcct ctggcagccc 420 gggcccgggc gagggctccg ctggcggcga aaagcgctcc accgcccctt cggccgcagc 480 ctcggcctct gcctcagccg cggcgtcgtc gcccgcgggg ggcggcgccg aggcgctgga 540 gctgctggag cactgcggcg tgtgcagaga gcgcctgcga cccgagaggg agccccgcct 600 gctgccctgt ttgcactcgg cctgtagtgc ctgcttaggg cccgcggccc ccgccgccgc 660 caacagctcg ggggacggcg gggcggcggg cgacggcacc gtggtggact gtcccgtgtg 720 caagcaacag tgcttctcca aagacatcgt ggagaattat ttcatgcgtg atagtggcag 780 caaggctgcc accgacgccc aggatgcgaa ccagtgctgc actagctgtg aggataatgc 840 cccagccacc agctactgtg tggagtgctc ggagcctctg tgtgagacct gtgtagaggc 900 gcaccagcgg gtgaagtaca ccaaggacca tactgtgcgc tctactgggc cagccaagtc 960 tcgggatggt gaacgtactg tctattgcaa cgtacacaag catgaacccc ttgtgctgtt 1020 ttgtgagagc tgtgatactc tcacctgccg agactgccag ctcaatgccc acaaggacca 1080 ccagtaccag ttcttagagg atgcagtgag gaaccagcgc aagctcctgg cctcactggt 1140 gaagcgcctt ggggacaaac atgcaacatt gcagaagagc accaaggagg ttcgcagctc 1200 aatccgccag gtgtctgacg tacagaagcg tgtgcaagtg gatgtcaaga tggccatcct 1260 gcagatcatg aaggagctga ataagcgggg ccgtgtgctg gtcaatgatg cccagaaggt 1320 gactgagggg cagcaggagc gcctggagcg gcagcactgg accatgacca agatccagaa 1380 gcaccaggag cacattctgc gctttgcctc ttgggctctg gagagtgaca acaacacagc 1440 ccttttgctt tctaagaagt tgatctactt ccagctgcac cgggccctca agatgattgt 1500 ggatcccgtg gagccacatg gcgagatgaa gtttcagtgg gacctcaatg cctggaccaa 1560 gagtgccgag gcctttggca agattgtggc agagcgtcct ggcactaact caacaggccc 1620 tgcacccatg gcccctccaa gagccccagg gcccctgagc aagcagggct ctggcagcag 1680 ccagcccatg gaggtgcagg aaggctatgg ctttgggtca ggagatgatc cctactcaag 1740 tgcagagccc catgtgtcag gtgtgaaacg gtcccgctca ggtgagggcg aggtgagcgg 1800 ccttatgcgc aaggtgccac gagtgagcct tgaacgcctg gacctggacc tcacagctga 1860 cagccagcca cccgtcttca aggtcttccc aggcagtacc actgaggact acaaccttat 1920 tgttattgaa cgtggcgctg ccgctgcagc taccggccag ccagggactg cgcctgcagg 1980 aacccctggt gccccacccc tggctggcat ggccattgtc aaggaggagg agacggaggc 2040 tgccattgga gcccctccta ctgccactga gggccctgag accaaacctg tgcttatggc 2100 tcttgcggag ggtcctggtg ctgagggtcc ccgcctggcc tcacctagtg gcagcaccag 2160 ctcagggctg gaggtggtgg ctcctgaggg tacctcagcc ccaggtggtg gcccgggaac 2220 cctggatgac agtgccacca tttgccgtgt ctgccagaag ccaggcgatc tggttatgtg 2280 caaccagtgt gagttttgtt tccacctgga ctgtcacctg ccggccctgc aggatgtacc 2340 aggggaggag tggagctgct cactctgcca tgtgctccct gacctgaagg aggaggatgg 2400 cagcctcagc ctggatggtg cagacagcac tggcgtggtg gccaagctct caccagccaa 2460 ccagcggaaa tgtgagcgtg tactgctggc cctattctgt cacgaaccct gccgccccct 2520 gcatcagctg gctaccgact ccaccttctc cctggaccag cccggtggca ccctggatct 2580 gaccctgatc cgtgcccgcc tccaggagaa gttgtcacct ccctacagct ccccacagga 2640 gtttgcccag gatgtgggcc gcatgttcaa gcaattcaac aagttaactg aggacaaggc 2700 agacgtgcag tccatcatcg gcctgcagcg cttcttcgag acgcgcatga acgaggcctt 2760 cggtgacacc aagttctctg ctgtgctggt ggagcccccg ccgatgagcc tgcctggtgc 2820 tggcctgagt tcccaggagc tgtctggtgg ccctggtgat ggcccctgag gctggagccc 2880 ccatggccag cccagcctgg ctctgttctc tgtcctgtca ccccatcccc actcccctgg 2940 tggcctgact cccactccct ggtggcccca tcccccagtt cctcacgata tggtttttac 3000 ttctgtggat ttaataaaaa aaacttcacc agttc 3035 312 924 DNA Homo sapiens 312 atgttctctt tcaacatgtt cgaccaccct attcccaggg tcttccaaaa ccgcttctcc 60 acacagtacc gctgcttctc tgtgtccatg ctagcagggc ctaatgacag gtcagatgtg 120 gagaaaggag ggaagataat tatgccaccc tcggccctgg accaactcag ccgacttaac 180 attacctatc ccatgctgtt caaactgacc aataagaatt cggaccgcat gacgcattgt 240 ggcgtgctgg agtttgtggc tgatgagggc atctgctacc tcccacactg gatgatgcag 300 aacttactct tggaagaagg cggcctggtc caggtggaga gcgtcaacct tcaagtggcc 360 acctactcca aattccaacc tcagagccct gacttcctgg acatcaccaa ccccaaagcc 420 gtattagaaa acgcacttag gaactttgcc tgtctgacca ccggggatgt gattgccatc 480 aactataatg aaaagatcta cgaactgcgt gtgatggaga ccaaacccga caaggcagtg 540 tccatcattg agtgtgacat gaacgtggac tttgatgctc ccctgggcta caaagaaccc 600 gaaagacaag tccagcatga ggagtcgaca gaaggtgaag ccgaccacag tggctatgct 660 ggagagctgg gcttccgcgc tttctctgga tctggcaata gactggatgg aaagaagaaa 720 ggggtagagc ccagcccctc cccaatcaag cctggagata ttaaaagagg aattcccaat 780 tatgaattta aacttggtaa gataactttc atcagaaatt cacgtcccct tgtcaaaaag 840 gttgaagagg atgaagctgg aggcagattc gtcgctttct ctggagaagg acagtcattg 900 cgtaaaaagg gaagaaagcc ctaa 924 313 1845 DNA Homo sapiens 313 gggacgtgag ccgctgcgcc caccgggcta gacccggcgc catcatgctg cttctgccaa 60 ygcgccgcgga cggccggggc accgccatca cccacgctct gacctctgcc tctacactct 120 gtcaagttga acctgtggga agatggtttg aagcttttgt taagaggaga aacagaaatg 180 cttctgcctc ttttcaggaa ctggaggata agaaagagtt atccgaggaa tcagaagatg 240 aagaattgca gttggaagag tttcccatgc tgaaaacact tgatcccaaa gactggaaga 300 accaagatca ttatgcagtt cttggacttg gccatgtgag atacaaggct acacagagac 360 agatcaaagc agctcataaa gcaatggttt taaaacatca cccagacaaa cggaaagcag 420 ctggtgaacc aataaaagaa ggagataatg actacttcac ttgcataact aaagcttatg 480 aaatgttatc tgatccagtg aaaagacgag catttaacag tgtagatcct acttttgata 540 actcagttcc ttctaaaagt gaagcaaagg ataatttctt cgaagtgttt accccagtgt 600 ttgaaaggaa ttccagatgg tcaaataaaa aaaatgttcc taaacttggt gatatgaatt 660 catcatttga agatgtagat atattttatt ctttctggta taattttgat tcttggagag 720 aattttctta tttagatgaa gaagaaaaag aaaaagcaga atgtcgtgat gagaggagat 780 ggattgaaaa gcagaacgga gcaacaagag cacaaagaaa aaaagaagaa atgaacagaa 840 taagaacatt agttgacaat gcatacagct gtgatccaag gataaaaaag ttcaaggaag 900 aagaaaaagc caagaaagaa gcagaaaaga aagcaaaagc agaagctaaa cggaaggagc 960 aagaagctaa agaaaaacaa agacaagctg aattagaagc tgctcggtta gctaaggaga 1020 aagaagagga ggaagtcaga cagcaagcat tgctggcaaa gaaggaaaaa gatatccaga 1080 aaaaagccat taagaaggaa aggcaaaaac ttcgaaactc atgcaagata gaagaaataa 1140 atgagcaaat cagaaaagag aaagaggaag ctgaggctcg tatgcgacaa gcatctaaga 1200 acacagagaa atcaactggt ggaggtggaa atggaagtaa aaattggtca gaagatgatc 1260 tacaattact aattaaagct gtgaatctgt tccctgctag aacaaattca agatgggaag 1320 ttattgctaa ttacatgaac atacattctt cctctggagt caaaagaact gccaaagatg 1380 ttattggcaa agcaaagagt ctccaaaaac ttgaccctca tcaaaaagat gacataaata 1440 aaaaggcatt tgataagttc aaaaaagaac atggagtggt acctcaagca gacaacgcaa 1500 cgccttcaga acgatttgaa ggtccatata cagacttcac cccttggaca acagaagaac 1560 agaagctttt ggaacaagct ttgaaaacat acccagtaaa tacacctgaa agatgggaaa 1620 aaatagcaga agcggtgcct ggcaggacaa agaaggactg catgaaacga tacaaggaac 1680 ttgtcgagat ggtaaaagca aagaaagctg ctcaagaaca agtgctgaat gcaagtagag 1740 ccaagaaatg acaatctttg ttgtgtgtgc atttttataa taaaactgaa aatactgtaa 1800 acattttcat tcttaaaatt atactcatgg taataatttg aaagt 1845 314 2623 DNA Homo sapiens 314 atagtggagg aagcagtgca ggagctgaac tctttcctcg cacaggagaa tatgaggcta 60 caggaattga cagatcttct tcaggaaaag catcgcacca tgtctcagga gttctccaag 120 ttgcagagta aagtggagac agccgaatca cgagtgtctg tcctggagtc catgattgat 180 gacctgcagt gggatattga caaaattcga aagagggaac agcgactcaa ccgacactta 240 gcagaagtcc tagaacgggt gaattccaaa ggttataagg tgtatggagc ggggagcagt 300 ctgtatggcg gcacaatcac tatcaatgct cggaagtttg aggaaatgaa tgcagagctt 360 gaggagaaca aagagttggc tcagaaccgt ctctgtgagc tggagaaact tcggcaagac 420 tttgaggagg tcactacaca aaatgaaaag ctgaaggtgg aattgcggag tgcagtggag 480 caagtcgtta aggaaactcc agaatatcgc tgcatgcagt cacagttctc cgtcttgtat 540 aatgagagcc tacagttgaa agcacacttg gatgaggctc ggaccctgct tcatggcacc 600 agaggaaccc accagcacca ggttgagctt attgagcgag atgaggttag tcttcataag 660 aagctgagga ctgaagtaat tcagctagaa gatacattgg cccaggtccg caaggagtat 720 gaaatgctga ggatagaatt tgagcagacc cttgctgcca atgaacaagc aggccctata 780 aacagggaga tgcgccacct catcagtagc ctccagaatc acaatcacca gctgaaaggg 840 gaggtcctga gatataagcg gaaattgaga gaagcccagt ctgacctgaa caagacacgc 900 ctgcgtagtg gtagtgccct cctgcagtcc cagtctagta ctgaggaccc gaaggatgag 960 cctgcggagc taaaaccaga ttctgaggac ttatcctccc agtcctcagc ttcaaaggca 1020 tctcaggagg atgccaatga aatcaagtct aaacgggatg aagaagaacg agaacgagaa 1080 aggagggaga aggagaggga acgagaaaga gaacgggaga aggagaagga gagagaacga 1140 gagaagcaga agctaaaaga gtcagaaaaa gagagagatt ctgctaagga taaagagaaa 1200 ggcaaacatg atgatggacg gaaaaaggaa gcagaaatta tcaaacaatt gaagattgaa 1260 ctcaagaagg cacaggagag ccaaaaggag atgaaactat tgctggatat gtaccgttct 1320 gccccaaagg aacagagaga caaagttcag ctgatggcag ctgagaagaa gtctaaggca 1380 gagttggaag atctaaggca aagactcaag gatctggaag ataaagagaa gaaagagaac 1440 aagaaaatgg ctgatgagga tgccttgagg aagatccggg cagtggagga gcagatagaa 1500 tacctacaga agaagctagc catggccaag caggaagaag aagcactcct ctctgaaatg 1560 gatgtcacag gccaggcctt tgaagacatg caggagcaaa atatccgttt gatgcagcaa 1620 ttgcgggaga aggatgatgc aaatttcaag ctcatgtcag agcgtatcaa gtccaatcag 1680 atccataagt tgcttaaaga agagaaggag gagctggcag accaggtgtt gactctgaag 1740 actcaggttg atgcccagct acaggtagta aggaaactgg aagagaagga gcatctgtta 1800 cagagcaaca ttggcacagg ggagaaagag ctgggtctta ggacccaagc cttagagatg 1860 aataaacgca aggcaatgga ggcagcccag cttgcagatg acctcaaagc acaactggag 1920 ttggctcaga agaagctaca tgattttcag gatgagatcg tggagaacag tgttaccaaa 1980 gaaaaggaca tgttcaattt caaacgagcc caggaggaca tctctagact tcgcaggaag 2040 ctggagacca caaagaaacc agacaatgta cccaagtgtg atgagattct gatggaagag 2100 attaaggatt acaaggcacg cttgacctgt ccgtgctgta acatgcgtaa aaaggatgct 2160 gttcttacta agtgttttca tgtcttctgc tttgagtgtg tgaagacacg ctatgacacc 2220 cgccagcgca aatgtcccaa gtgtaatgct gcttttggtg ccaatgattt tcatcgcatc 2280 tacattggtt gatctaagtc aagagaagaa gaggagctgg ctagtcagga acttattcat 2340 taaccaccaa acctctacct cttctctcct tgactgtcac ctgtaggaca gtttatcagt 2400 caactacctt tcctccagac tttacttcca ggctctcctc ttcagtagct ggatgacttt 2460 agcagaaagg actggtaaat acaagccttg ggtttcagaa tgaattagaa acaaataact 2520 cttactgtct tccctcccag ctttgtttat tttgtgcttt tagacttttc agtgttttct 2580 ttttccagcc cactgtataa acttggattg tccattcctc ctg 2623 315 1505 DNA Homo sapiens 315 agctgcggcg ctgggctggc ggcggcgagt ccacgtgctc cccgcggccg gttgaaaccg 60 ttggcgggcg ctggctgaga ggcaatgttt gctgtcttcc attggagtga ctgaatttct 120 acatgacggc tttttgacaa gacttaaaac ctgtcttgga tagagaatat ttagccattt 180 acctaaaaat ggtatttttt acatgcaatg catgtggtga atcagtgaag aaaatacaag 240 tggaaaagca tgtgtctgtt tgcagaaact gtgaatgcct ttcttgcatt gactgcggta 300 aagatttctg gggcgatgac tataaaaacc acgtgaaatg cataagtgaa gatcagaagt 360 atggtggcaa aggctatgaa ggtaaaaccc acaaaggcga catcaaacag caggcgtgga 420 ttcagaaaat tagtgaatta ataaagagac ccaatgtcag ccccaaagtg agagaacttt 480 tagagcaaat tagtgctttt gacaacgttc ccaggaaaaa ggcaaaattt cagaattgga 540 tgaagaacag tttaaaagtt cataatgaat ccattctgga ccaggtgtgg aatatctttt 600 ctgaagcttc caacagcgaa ccagtcaata aggaacagga tcaacggcca ctccacccag 660 tggcaaatcc acatgcagaa atctccacca aggttccagc ctccaaagtg aaagacgccg 720 tggaacagca aggggaggtg aagaagaata aaagagaaag aaaggaagaa cggcagaaga 780 aaaggaaaag agaaaagaaa gaactaaagt tagaaaacca ccaggaaaac tcaaggaatc 840 agaagcctaa gaagcgcaaa aagggacagg aggctgacct tgaggctggt ggggaggaag 900 tccctgaggc caatggctct gcagggaaga ggagcaagaa gaagaagcag cgcaaggaca 960 gcgccagtga ggaagaggca cgcgtgggcg cagggaagag gaagcggagg cactcggaag 1020 ttgaaacaga ttctaagaag aaaaagatga agctcccaga gcatcctgag ggcggagaac 1080 cagaagacga tgaggctcct gcaaaaggta aattcaactg gaagggaact attaaagcaa 1140 ttctgaaaca ggccccagac aatgaaataa ccatcaaaaa gctaaggaaa aaggttttag 1200 ctcagtacta cacagtgaca gatgagcatc acagatccga agaggaactc ctggtcatct 1260 ttaacaagaa aatcagcaag aaccctacct ttaagttatt aaaggacaaa gtcaagcttg 1320 tgaaatgaac atttgtgtat ttaaaaattg aatccattct gctgacttct tcctttcact 1380 gctgtttata aaatgtgtaa tgaattctaa caactcaaat tttgcttttt gaagctgtat 1440 ttttaagtta agaaaatata tttttggtat aacttttatg agaaaaataa aatatattct 1500 ggtcc 1505 316 1732 DNA Homo sapiens 316 actctcggga tggagggcga gcgccgggca tcgcaggcgc cctcctcggg cctcccggcc 60 gggggcgcca acggggagag cccggggggc ggcgccccct ttccgggcag cagcggctct 120 tccgccctgc tgcaggcgga ggtgctggat ctggacgagg acgaggacga cctggaggtg 180 ttcagcaagg atgcctcatt gatggacatg aactccttca gccctatgat gccaacatcc 240 cctttatcaa tgataaacca aatcaagttt gaggatgaac cagatttaaa ggatctcttc 300 atcacagttg atgaacctga aagtcatgtt actacaatag aaactttcat tacgtatagg 360 attattacta agacatctcg tggggaattt gactccagtg aatttgaagt taggagacga 420 tatcaagatt tcctttggtt gaagggaaaa ctggaagaag cacaccccac tctgattatt 480 ccaccattgc cagaaaagtt tatagtaaaa ggaatggtgg aacgctttaa cgatgacttc 540 attgagacac gcaggaaggc tttacataaa tttttgaacc gaattgctga tcatccaact 600 ttaacattta atgaagactt caaaattttt ctcactgcac aagcttggga actctcttct 660 cacaagaagc aaggtcctgg cttgctaagc aggatggggc aaaccgtcag agctgttgcg 720 tcctcaatga gaggagttaa aaaccgccca gaggagttca tggaaatgaa taactttatt 780 gaactattta gccagaaaat aaatttgata gataaaatat ctcagagaat ttataaggaa 840 gaaagggaat attttgatga aatgaaagaa tatggcccaa ttcatattct gtggtcagcg 900 tcagaagagg atctggttga tactctaaag gatgttgcca gctgcattga cagatgctgt 960 aaggccactg aaaagcggat gtctggactc tcagaggccc tgcttcctgt tgtacatgag 1020 tacgtgcttt atagtgaaat gttaatgggt gttatgaaaa gaagagacca aatacaagca 1080 gaactggatt ccaaagttga agttttgacc tataaaaagg cagatactga tctgcttcca 1140 gaggagattg gaaaacttga agataaagtg gaatgtgcta ataatgccct gaaagcagat 1200 tgggagagat ggaaacaaaa tatgcaaaat gatatcaagt tagcatttac agatatggct 1260 gaggagaata tccattatta tgaacagtgc cttgctacgt gggaatcatt ccttacatca 1320 cagaccaacc ttcacttgga agaagcctct gaagataaac cttaatccca ttgaggactt 1380 ctgtttgatc tttgggagac agcatttatt aaccaaagtt attctttctg gatctgccgt 1440 gtccttataa agtggatgaa aaatgttttg tacccatctg gaaaaccaac aacttgaaat 1500 ctcaggtatt ccaggtcact gacatgaatt tgaagatata tctatctgta tggatatata 1560 tctatatgta tatagatata taaatacaga gagatatctg gcttggtttt aattatgttc 1620 ttaaatttgt gtgccaataa ttgcatatag attttttttc ttaaatattt gactgtggaa 1680 catgccattt taaatatgtt gtaaggactg ttttaataaa aagtttagta tg 1732 317 1912 DNA Homo sapiens misc_feature (1)...(1912) n = A,T,C or G 317 gccgcgggcg gcggcggcag cggttggagg ttgtaggacc ggcgaggaat aggaatcatg 60 gcggctgcgc tgttcgtgct gctgggattc gcgctgctgg gcacccacgg agcctccggg 120 gctgccggca cagtcttcac taccgtagaa gaccttggct ccaagatact cctcacctgc 180 tccttgaatg acagcgccac agaggtcaca gggcaccgct ggctgaaggg gggcgtggtg 240 ctgaaggagg acgcgctgcc cggccagaaa acggagttca aggtggactc cgacgaccag 300 tggggagagt actcctgcgt cttcctcccc gagcccatgg gcacggccaa catccagctc 360 cacgggcctc ccagagtgaa ggctgtgaag tcgtcagaac acatcaacga gggggagacg 420 gccatgctgg tctgcaagtc agagtccgtg ccacctgtca ctgactgggc ctggtacaag 480 atcactgact ctgaggacaa ggccctcatg aacggctccg agagcaggtt cttcgtgagt 540 tcctcgcagg gccggtcaga gctacacatt gagaacctga acatggaggc cgaccccggc 600 cagtaccggt gcaacggcac cagctccaag ggctccgacc aggccatcat cacgctccgc 660 gtgcgcagcc acctggccgc cctctggccc ttcctgggca tcgtggctga ggtgctggtg 720 ctggtcacca tcatcttcat ctacgagaag cgccggaagc ccgaggacgt cctggatggt 780 gagccgtctg cccnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 nnnnnnnnnn nntgccccag gaagagcagc gggcagcacc agaatgacaa aggcaagaac 1140 gtccgccaga ggaactcttc ctgaggcagg tggcccgagg acgctccctg ctccncgtct 1200 gcgccgccgc cggagtccac tcccagtgct tgcaagattc caagttctca cctcttaaag 1260 aaaacccacc ccgtagattc ccatcataca cttccttctt ttttaaaaaa gttgggtttt 1320 ctccattcag gattctgttc cttaggnttt tttccttctg aagtgtttca cgagagcccg 1380 ggagctgctg ccctgcggcc ccgtctgtgg ctttcagcct ctgggtctga gtcatggccg 1440 ggtgggcggc acagccttct ccactggccg gagtcagtgc caggtccttg ccctttgtgg 1500 aaagtcacag gtcacacgag gggccccgtg tcctgcctgt ctgaagccaa tgctgtctgg 1560 ttgcgccatt tttgtgcttt tatgtttaat tttatgaggg ccacgggtct gtgttcgact 1620 cagcctcagg gacgactctg acctcttggc cacagaggac tcacttgccc acaccgaggg 1680 cgaccccgtc acagcctcaa gtcactccca agccccctcc ttgtctgtgc atccgggggc 1740 agctctggag ggggtttgct ggggaactgg cgccatcgcc gggactccag aaccgcagaa 1800 gcctccccag ctcacccctg gaggacggcc ggctctctat agcaccaggg ctcacgtggg 1860 aacccccctc ccacccaccg ccacaataaa gatcgccccc acctccaccc tc 1912 318 2780 DNA Homo sapiens 318 ggcggcggta gcagccaggc ttggcccccg gcgtggagca gacgcggacc cctccttcct 60 ggcggcggcg gcgcgggctc agagcccggc aacgggcggg cgggcagaat gagtctgcag 120 gtcttaaacg acaaaaatgt cagcaatgaa aaaaatacag aaaattgcga cttcctgttt 180 tcgccaccag aagttaccgg aagatcgtct gttcttcgtg tgtcacagaa agaaaatgtg 240 ccacccaaga acctggccaa agctatgaag gtgacttttc agacacctct gcgggatcca 300 cagacgcaca ggattctaag tcctagcatg gccagcaaac ttgaggctcc tttcactcag 360 gatgacaccc ttggactgga aaactcacac ccggtctgga cacagaaaga gaaccaacag 420 ctcatcaagg aagtggatgc caaaactact catggaattc tacagaaacc agtggaggct 480 gacaccgacc tcctggggga tgcaagccca gcctttggga gtggcagctc cagcgagtct 540 ggcccaggtg ccctggctga cctggactgc tcaagctctt cccagagccc aggaagttct 600 gagaaccaaa tggtgtctcc aggaaaagtg tctggcagcc ctgagcaagc cgtggaggaa 660 aaccttagtt cctattcctt agacagaaga gtgacacccg cctctgagac cctagaagac 720 ccttgcagga cagagtccca gcacaaagcg gagactccgc acggagccga ggaagaatgc 780 aaagcggaga ctccgcacgg agccgaggag gaatgccggc acggtggggt ctgtgctccc 840 gcagcagtgg ccacttcgcc tcctggtgca atccctaagg aagcctgcgg aggagcaccc 900 ctgcagggtc tgcctggcga agccctgggc tgccctgcgg gtgtgggcac ccccgtgcca 960 gcagatggca ctcagaccct tacctgtgca cacacctctg ctcctgagag cacagcccca 1020 accaaccacc tggtggctgg cagggccatg accctgagtc ctcaggaaga agtggctgca 1080 ggccaaatgg ccagctcctc gaggagcgga cctgtaaaac tagaatttga tgtatctgat 1140 ggcgccacca gcaaaagggc acccccacca aggagactgg gagagaggtc cggcctcaag 1200 cctcccttga ggaaagcagc agtgaggcag caaaaggccc cgcaggaggt ggaggaggac 1260 gacggtagga gcggagcagg agaggacccc cccatgccag cttctcgggg ctcttaccac 1320 ctcgactggg acaaaatgga tgacccaaac ttcatcccgt tcggaggtga caccaagtct 1380 ggttgcagtg aggcccagcc cccagaaagc cctgagacca ggctgggcca gccagcggct 1440 gaacagttgc atgctgggcc tgccacggag gagccaggtc cctgtctgag ccagcagctg 1500 cattcagcct cagcggagga cacgcctgtg gtgcagttgg cagccgagac cccaacagca 1560 gagagcaagg agagagcctt gaactctgcc agcacctcgc ttcccacaag ctgtccaggc 1620 agtgagccag tgcccaccca tcagcagggg cagcctgcct tggagctgaa agaggagagc 1680 ttcagagacc ccgctgaggt tctaggcacg ggcgcggagg tggattacct ggagcagttt 1740 ggaacttcct cgtttaagga gtcggccttg aggaagcagt ccttatacct caagttcgac 1800 cccctcctga gggacagtcc tggtagacca gtgcccgtgg ccaccgagac cagcagcatg 1860 cacggtgcaa atgagactcc ctcaggacgt ccgcgggaag ccaagcttgt ggagttcgat 1920 ttcttgggag cactggacat tcctgtgcca ggcccacccc caggtgttcc cgcgcctggg 1980 ggcccacccc tgtccaccgg acctatagtg gacctgctcc agtacagcca gaaggacctg 2040 gatgcagtgg taaaggcgac acaggaggag aaccgggagc tgaggagcag gtgtgaggag 2100 ctccacggga agaacctgga actggggaag atcatggaca ggttcgaaga ggttgtgtac 2160 caggccatgg aggaagttca gaagcagaag gaactttcca aagctgaaat ccagaaagtt 2220 ctaaaagaaa aagaccaact taccacagat ctgaactcca tggagaagtc cttctccgac 2280 ctcttcaagc gttttgagaa acagaaagag gtgatcgagg gctaccgcaa gaacgaagag 2340 tcactgaaga agtgcgtgga ggattacctg gcaaggatca cccaggaggg ccagaggtac 2400 caagccctga aggcccacgc ggaggagaag ctgcagctgg caaacgagga gatcgcccag 2460 gtccggagca aggcccaggc ggaagcgttg gccctccagg ccagcctgag gaaggagcag 2520 atgcgcatcc agtcgctgga gaagacagtg gagcagaaga ctaaagagaa cgaggagctg 2580 accaggatct gcgacgacct catctccaag atggagaaga tctgacctcc acggagccgc 2640 tgtccccgcc cccctgctcc cgtctgtctg tcctgtctga ttctcttagg tgtcatgttc 2700 ttttttctgt cttgtcttca acttttttta aaactagatt gctttgaaaa catgactcaa 2760 taaaagtttc ctttcaattt 2780 319 1675 DNA Homo sapiens 319 ccagccgtcc attccggtgg aggcagaggc agtcctgggg ctctggggct cgggctttgt 60 caccgggacc cgcaggagcc agaaccactc ggcgccgcct ggtgcatggg aggggagccg 120 ggccaggagt aagtaactca tacgggcgcc ggggacccgg gtcgggctgg gggcttccaa 180 ctcagaggga gtgtgatttg cctgatcctc ttcggcgttg tcctgctctg ccgcatccag 240 ccctgtaccg ccatcccact tcccgccgtt cccatctgtg ttccgggtgg gatcggtctg 300 gaggcggccg aggacttccc aggcaggagc tcggggcgga ggccgggtcc gcggcagacc 360 agggcagcga ggcgctggcc ggcagggggc gctgcggtgc cagcctgagg ctgggctgct 420 ccgcgaggat acagcggccc ctgccctgtc ctgtcctgcc ctgccctgtc ctgtcctgcc 480 ctgccctgcc ctgtcctgtc ctgccctgcc ctgccctgtg tcctcagaca atatgttagc 540 cgtgcacttt gacaagccgg gaggaccgga aaacctctac gtgaaggagg tggccaagcc 600 gagcccgggg gagggtgaag tcctcctgaa ggtggcggcc agcgccctga accgggcgga 660 cttaatgcag agacaaggcc agtatgaccc acctccagga gccagcaaca ttttgggact 720 tgaggcatct ggacatgtgg cagagctggg gcctggctgc cagggacact ggaagatcgg 780 ggacacagcc atggctctgc tccccggtgg gggccaggct cagtacgtca ctgtccccga 840 agggctcctc atgcctatcc cagagggatt gaccctgacc caggctgcag ccatcccaga 900 ggcctggctc accgccttcc agctgttaca tcttgtggga aatgttcagg ctggagacta 960 tgtgctaatc catgcaggac tgagtggtgt gggcacagct gctatccaac tcacccggat 1020 ggctggagct attcctctgg tcacagctgg ctcccagaag aagcttcaaa tggcagaaaa 1080 gcttggagca gctgctggat tcaattacaa aaaagaggat ttctctgaag caacgctgaa 1140 attcaccaaa ggtgctggag ttaatcttat tctagactgc ataggcggat cctactggga 1200 gaagaacgtc aactgcctgg ctcttgatgg tcgatgggtt ctctatggtc tgatgggagg 1260 aggtgacatc aatgggcccc tgttttcaaa gctacttttt aagcgaggaa gtctgatcac 1320 cagtttgctg aggtctaggg acaataagta caagcaaatg ctggtgaatg ctttcacgga 1380 gcaaattctg cctcacttct ccacggaggg cccccaacgt ctgctgccgg ttctggacag 1440 aatctaccca gtgaccgaaa tccaggaggc ccataagtac atggaggcca acaagaacat 1500 aggcaagatc gtcctggaac tgccccagtg aaggaggatg gggcaggaca ggacgcggcc 1560 accccaggcc tttccagagc aaacctggag aagattcaca atagacaggc caagaaaccc 1620 ggtgcttcct ccagagccgt ttaaagctga tatgaggaaa taaagagtga actgg 1675 320 1485 DNA Homo sapiens 320 gggattaagg cgtgagtgac tgcgcccggc catttttttg tatttttagt aaagatgggg 60 tttcaccatg ttggccaggc tggttgcaaa ctcctggcct caagtgaact gcctaccttg 120 gcctcccaaa gtgctgggat tgcaggtgtg agccaccaca cctggcctcg tttcatcttc 180 atatcagcac tgatgtagat ggtcttatcc tcattttata aatgaagaaa tgaaggctta 240 catagtatga ttggtgaaag gcacaaggct gaatggcacc accaaatgtg ctattgtgcc 300 tgtgtccctg gctctaggaa tactcctaga cctgcctcaa agctgccgtc tagttcttga 360 ggcactttgt gatcagtagg ttaatccttc tgacttaacc caaaagcttt ggggctccac 420 tttcttttcc tctctgaaga ctgacacagc tcccagataa cccctcccca ctttgaggac 480 ttgatgctgt agatggtttt aatgaagcca tggccttctc tggcaatact gtaggccatc 540 ggtgctcgga acttgctcaa atctatagca aagcagagag aagctcaaca gcagcaactt 600 caagccctaa tagcagaaaa gaaaatgcag ctagaaaggt aagaagtcat gatgtaaagc 660 aagttcttta cacagctgaa tttccactct tccttctgga atgatcactt cttgtcctta 720 ttcttcatca ttgttttccc aggaatagca tctcacttgt catcctgatt tgaagatttt 780 aaaaaataaa attacatttg atgggaagga aatagaaaac ttgttaatat caaaccacct 840 ctagtagttt gagcatgaaa gggtccacga agttggctgg ctctctgaaa gaagactttc 900 ctagggctgg agatttctca tgggtggctc tgggtagggc tttgcctctt tctttcttca 960 ttctgaactc acaccatttc ttgaaatcat ttaaacctct atatccttct ctggaagaat 1020 taggtctgtg gtttacttca tctttttttc agggcctcta acaattttgt ttttcaggta 1080 tcgggttgaa tatgaagctt tgtgtaaagt agaagcagaa caaaatgaat ttattgacca 1140 atttattttt cagaaatgaa ctgaaaattt cgcttttata gtaggaaggc aaaacaaaaa 1200 aaagcctctc aaaaccaaaa aaacctctgt agcattccag cggcttgacc aatgacctat 1260 gtcacaagag gtggcgtgta aggaatgcag ccccctgaag acagcactac aagtctgggg 1320 gagccagttt taacatcagt gcacagctgc tgctggtggc cctgcagtgt acgttctcac 1380 ctcttatgct tagttggaac taagcagttt gtaaactttc atcctttttt ttgtaaattc 1440 acaaagcttt ggaaggagaa gcaataaatt tttgttttca aatgg 1485 321 1399 DNA Homo sapiens 321 agtgtgaaat cttcagagaa gaatttctct ttagttcttt gcaagaaggt agagataaag 60 acactttttc aaaaatggca atggtatcag aattcctcaa gcaggcctgg tttattgaaa 120 atgaagagca ggaatatgtt caaactgtga agtcatccaa aggtggtccc ggatcagcgg 180 tgagccccta tcctaccttc aatccatcct cggatgtcgc tgccttgcat aaggccataa 240 tggttaaagg tgtggatgaa gcaaccatca ttgacattct aactaagcga aacaatgcac 300 agcgtcaaca gatcaaagca gcatatctcc aggaaacagg aaagcccctg gatgaaacac 360 tgaagaaagc ccttacaggt caccttgagg aggttgtttt agctctgcta aaaactccag 420 cgcaatttga tgctgatgaa cttcgtgctg ccatgaaggg ccttggaact gatgaagata 480 ctctaattga gattttggca tcaagaacta acaaagaaat cagagacatt aacagggtct 540 acagagagga actgaagaga gatctggcca aagacataac ctcagacaca tctggagatt 600 ttcggaacgc tttgctttct cttgctaagg gtgaccgatc tgaggacttt ggtgtgaatg 660 aagacttggc tgattcagat gccagggcct tgtatgaagc aggagaaagg agaaagggga 720 cagacgtaaa cgtgttcaat accatcctta ccaccagaag ctatccacaa cttcgcagag 780 tgtttcagaa atacaccaag tacagtaagc atgacatgaa caaagttctg gacctggagt 840 tgaaaggtga cattgagaaa tgcctcacag ctatcgtgaa gtgcgccaca agcaaaccag 900 ctttctttgc agagaagctt catcaagcca tgaaaggtgt tggaactcgc cataaggcat 960 tgatcaggat tatggtttcc cgttctgaaa ttgacatgaa tgatatcaaa gcattctatc 1020 agaagatgta tggtatctcc ctttgccaag ccatcctgga tgaaaccaaa ggagattatg 1080 agaaaatcct ggtggctctt tgtggaggaa actaaacatt cccttgatgg tctcaagcta 1140 tgatcagaag actttaatta tatattttca tcctataagc ttaaatagga aagtttcttc 1200 aacaggatta cagtgtagct acctacatgc tgaaaaatat agcctttaaa tcatttttat 1260 attataactc tgtataatag agataagtcc attttttaaa aatgttttcc ccaaaccata 1320 aaaccctata caagttgttc tagtaacaat acatgagaaa gatgtctatg tagctgaaaa 1380 taaaatgacg tcacaagac 1399 322 2739 DNA Homo sapiens 322 cgcacgcgca gtcgtatccg tgtgatgggc gggctgttga cggcgctgcg atggctgcct 60 gcgagggcag gagaagcgga gctctcggtt cctctcagtc ggacttcctg acgccgccag 120 tgggcggggc cccttgggcc gtcgccacca ctgtagtcat gtacccaccg ccgccgccgc 180 cgcctcatcg ggacttcatc tcggtgacgc tgagctttgg cgagagctat gacaacagca 240 agagttggcg gcggcgctcg tgctggagga aatggaagca actgtcgaga ttgcagcgga 300 atatgattct cttcctcctt gcctttctgc ttttctgtgg actcctcttc tacatcaact 360 tggctgacca ttggaaagct ctggctttca ggctagagga agagcagaag atgaggccag 420 aaattgctgg gttaaaacca gcaaatccac ccgtcttacc agctcctcag aaggcggaca 480 ccgaccctga gaacttacct gagatttcgt cacagaagac acaaagacac atccagcggg 540 gaccacctca cctgcagatt agacccccaa gccaagacct gaaggatggg acccaggagg 600 aggccacaaa aaggcaagaa gcccctgtgg atccccgccc ggaaggagat ccgcagagga 660 cagtcatcag ctggagggga gcggtgatcg agcctgagca gggcaccgag ctcccttcaa 720 gaagagcaga agtgcccacc aagcctcccc tgccaccggc caggacacag ggcacaccag 780 tgcatctgaa ctatcgccag aagggcgtga ttgacgtctt cctgcatgca tggaaaggat 840 accgcaagtt tgcatggggc catgacgagc tgaagcctgt gtccaggtcc ttcagtgagt 900 ggtttggcct cggtctcaca ctgatcgacg cgctggacac catgtggatc ttgggtctga 960 ggaaagaatt tgaggaagcc aggaagtggg tgtcgaagaa gttacacttt gaaaaggacg 1020 tggacgtcaa cctgtttgag agcacgatcc gcatcctggg ggggctcctg agtgcctacc 1080 acctgtctgg ggacagcctc ttcctgagga aagctgagga ttttggaaat cggctaatgc 1140 ctgccttcag aacaccatcc aagattcctt actcggatgt gaacatcggt actggagttg 1200 cccacccgcc acggtggacc tccgacagca ctgtggccga ggtgaccagc attcagctgg 1260 agttccggga gctctcccgt ctcacagggg ataagaagtt tcaggaggca gtggagaagg 1320 tgacacagca catccacggc ctgtctggga agaaggatgg gctggtgccc atgttcatca 1380 atacccacag tggcctcttc acccacctgg gcgtattcac gctgggcgcc agggccgaca 1440 gctactatga gtacctgctg aagcagtgga tccagggcgg gaagcaggag acacagctgc 1500 tggaagacta cgtggaagcc atcgagggtg tcagaacgca cctgctgcgg cactccgagc 1560 ccagtaagct cacctttgtg ggggagcttg cccacggccg cttcagtgcc aagatggacc 1620 acctggtgtg cttcctgcca gggacgctgg ctctgggcgt ctaccacggc ctgcccgcca 1680 gccacatgga gctggcccag gagctcatgg agacttgtta ccagatgaac cggcagatgg 1740 agacggggct gagtcccgag atcgtgcact tcaaccttta cccccagccg ggccgtcggg 1800 acgtggaggt caagccagca gacaggcaca acctgctgcg gccagagacc gtggagagcc 1860 tgttctacct gtaccgcgtc acaggggacc gcaaatacca ggactggggc tgggagattc 1920 tgcagagctt cagccgattc acacgggtcc cctcgggtgg ctattcttcc atcaacaatg 1980 tccaggatcc tcagaagccc gagcctaggg acaagatgga gagcttcttc ctgggggaga 2040 cgctcaagta tctgttcttg ctcttctccg atgacccaaa cctgctcagc ctggacgcct 2100 acgtgttcaa caccgaagcc caccctctgc ctatctggac ccctgcctag ggtggatggc 2160 tgctggtgtg gggacttcgg gtgggcagag gcaccttgct gggtctgtgg cattttccaa 2220 ggcccacgta gcaccggcaa ccgccaagtg gcccaggctc tgaactggct ctgggctcct 2280 cctcgtctct gctttaatca ggacaccgtg aggacaagtg aggccgtcag tcttggtgtg 2340 atgcggggtg ggctgggccg ctggagcctc cgcctgcttc ctccagaaga cacgaatcat 2400 gactcacgat tgctgaagcc tgagcaggtc tctgtgggcc gaccagaggg gggcttcgag 2460 gtggtccctg gtactggggt gaccgagtgg acagcccagg gtgcagctct gcccgggctc 2520 gtgaagcctc agrtgtcccc aatccaaggg tctggagggg ctgccgtgac tccagaggcc 2580 tgaggctcca gggctggctc tggtgtttac aagctggact cagggatcct cctggccgcc 2640 ccgcaggggg cttggagggc tggacggcaa gtccgtctag ctcacgggcc cctccagtgg 2700 aatgggtctt ttcggtggag ataaaagttg atttgctct 2739 323 4372 DNA Homo sapiens 323 cacgactcca cacgcgcgca cgcagccagc gagcggccgg agcggacggc agacggggcg 60 ggcggcgtca gggtcgcagc gtctacagct gctcgggggc ggtttcttgg cggaggcttg 120 gccggctcct ctctcccggc tccgcggcgg ctgcgaaggc ggcggctcct gccctctcgc 180 tttccctctc gcgtctctgg ctgcaggtga aaggaaagca agccaggatg gatatttacg 240 acactcaaac cttgggggtt gtggtctttg gaggattcat ggttgtttct gccattggca 300 tcttcctggt gtcgactttc tccatgaagg aaacgtcata tgaagaagcc ctagccaacc 360 agcgcaagga gatggcgaaa actcaccacc agaaagtcga gaagaaaaag aaggagaaaa 420 cagtggagaa gaaaggaaag accaagaaaa aggaagagaa acctaatggg aagatacctg 480 atcatgatcc agcccccaat gtgactgtcc tccttcgaga accagtgcgg gctcctgctg 540 tggctgtggc tccaacccca gtgcagcccc ccattatcgt tgctcctgtc gccacagttc 600 cagccatgcc ccaggagaag ctggcctcct cccccaagga caaaaagaag aaggagaaaa 660 aagtggcaaa agtggaacca gctgtcagct ctgtagtgaa ttccatccag gttctcactt 720 cgaaggctgc catcttggaa actgctccca aggaggtgcc gatggtggtg gtgcccccag 780 tgggtgccaa gggcaacaca ccagccactg gcactactca gggcaaaaag gcggagggga 840 ctcagaatca aagcaaaaag gctgaaggag ccccaaacca gggcagaaag gcagagggaa 900 ccccaaacca gggcaaaaag acagagggaa ccccaaacca agggaaaaag gcagagggaa 960 ccccaaacca aggcaaaaag gcagaaggaa ccccaaacca aggcaaaaag gcggaggggg 1020 cccagaacca gggtaaaaag gtagatacaa ccccaaacca ggggaaaaag gtggaggggg 1080 ccccaaccca gggcagaaag gccgaggggg ctcagaacca ggccaaaaag gtagaagggg 1140 cccagaacca gggcaaaaag gcagaggggg cccagaatca gggcaaaaag ggagaggggg 1200 cccagaacca gggcaagaag gccgaggggg cccagaatca gggcaagaag gccgaggggg 1260 cccagaatca gggcaagaag gccgaggggg cccagaatca gggcaagaag gccgaggggg 1320 cccagaatca gggcaagaag gctgaggggg ctcagaacca gggcaaaaag gccgaggggg 1380 ctcagaacca gggcaaaaaa gtagaagggg cccagaacca gggcaagaag gctgagggtg 1440 cccagaacca gggcaaaaag gccgaggggg cccagaatca gggcaaaaag gccgaggggg 1500 cccagaacca gggcaagaag gcagaggggg cccagaacca gggcaagaag gccgaggggg 1560 cccagaacca ggacaagaag gccgaggggg cccagaacca gggcaggaag gccgaggggg 1620 cccagaacca gggcaggaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 1680 cccagaacca gggcaagaag gccgaggggg ccccgaacca gggcaagaag gccgagggga 1740 ccccgaacca gggcaagaag gccgagggga ccccgaacca gggcaagaag gccgagggga 1800 ccccgaacca gggcaagaag gccgagggga ccccgaacca gggcaagaag gccgaggggg 1860 cccagaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgagggga 1920 ccccgaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 1980 cccagaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 2040 cccagaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 2100 cccagaacca gggcaagaag gccgagggtg ctcagaacca gggcaaaaaa gtagaagggg 2160 cccagaacca gggcaagaag gctgaggggg cccagaacca gggcaagaag gccgaggggg 2220 ctcagaacca gggcaaaaag gccgagggag cccagaacca gggccaaaaa ggagagggag 2280 cccagaatca gggtaaaaag acagaagggg ctcagggcaa aaaggcagaa aggagtccca 2340 accaaggcaa aaaaggagag ggagctccca tccagggcaa aaaggcagat tcggttgcta 2400 atcagggcac aaaggtagag ggtattacaa accaggggaa aaaagcagaa gggtccccca 2460 gtgaaggcaa aaaggcagaa gggtccccca accaaggcaa aaaggcagac gcagctgcca 2520 atcagggtaa aaagacagag tcagcttctg tccagggcag aaatacagat gtggcccaga 2580 gcccagaggc accaaagcaa gaggctcctg ccaagaagaa gtctggttca aagaaaaaag 2640 gtgagcctgg gcccccagat gccgacggcc ctctctacct cccctacaag acgctggtct 2700 ccacggttgg gagcatggtg ttcaacgagg gcgaggccca gcggctcatc gagatcctgt 2760 ctgagaaggc tggcatcatt caggacacct ggcacaaggc cactcagaag ggtgaccctg 2820 tggcgattct gaaacgccag ctggaagaga aggaaaaact gctggccaca gaacaggaag 2880 atgcggctgt cgccaagagc aaactgaggg agctcaacaa ggagatggca gcagaaaagg 2940 ccaaagcagc agccggggag gccaaagtga aaaagcagct ggtggcccgg gagcaggaga 3000 tcacggctgt gcaggcacgc atgcaggcca gctaccggga gcacgtgaag gaggtgcagc 3060 agctgcaggg caagatccgg actcttcagg agcagctgga gaatggcccc aacacgcagc 3120 tggcccgcct gcagcaggag aactccatcc tgcgggatgc cttgaaccag gccacgagcc 3180 aggtggagag caagcagaac gcagagctgg ccaagcttcg gcaggagctc agcaaggtca 3240 gcaaagagct ggtggagaag tcagaggctg tgcggcaaga tgagcagcag cggaaagctc 3300 tggaagccaa ggcagctgcc ttcgagaagc aggtcctgca gctgcaggcg tcccacaggg 3360 agagtgagga ggccctgcag aagcgcctgg acgaggtcag ccgggagctg tgccacacgc 3420 agagcagcca cgccagcctc cgggcggatg ccgagaaggc ccaggagcaa cagcagcaga 3480 tggccgagct gcacagcaag ttacagtcct ccgaggcgga ggtgcgcagc aaatgcgagg 3540 agctgagtgg cctccacggg cagctccagg aggccagggc ggagaactcc cagctcacag 3600 agagaatccg ttccattgag gccctgctgg aggcgggcca ggcgcgggat gcccaggacg 3660 tccaggccag ccaggcggag gctgaccagc agcagactcg cctcaaggag ctggagtccc 3720 aggtgtcggg tctggagaag gaggccatcg agctcaggga ggccgtcgag cagcagaaag 3780 tgaagaacaa tgacctccgg gagaagaact ggaaggccat ggaggcactg gccacggccg 3840 agcaggcctg caaggagaag ctgcactccc tgacccaggc caaggaggaa tcggagaagc 3900 agctctgtct gattgaggcg cagaccatgg aggccctgct ggctctgctc ccagaactct 3960 ctgtcttggc acaacagaat tacaccgagt ggctgcagga tctcaaagag aaaggcccca 4020 cgctgctgaa gcacccgcca gctcccgcgg agccctcctc ggacctggcc tccaagttga 4080 gggaggccga ggagacgcag agcacactgc aggccgagtg tgaccagtac cgcagcatcc 4140 tggcggagac ggagggcatg ctcagagacc tgcagaagag cgtggaggag gaggagcagg 4200 tgtggagggc caaggtgggc gccgcagagg aggagctcca gaagtcccgg gtcacagtga 4260 agcatctcga agagattgta gagaagctaa aaggagaact tgaaagttcg gaccaggtga 4320 gggagcacac gttgcatttg gaggcagagc tggaaaagca catggcggcc gc 4372 324 2570 DNA Homo sapiens 324 gctgtcctga gcctgagtac tctagctgcc ttgtcgtcat cgcatctggc tgccatccag 60 cgccagcaca cagtaatgag tggccgagct tcctctggga gggaggaaac agttaaaatc 120 ttgcagcagc tgcaatcatc taggcgtggt tctcttgtct gacttgggct gcacagatcc 180 tgggccaagg gacagaagaa agacagccta ggagcagagc ctcccagatg gctgagttgg 240 atctaatggc tccagggcca ctgcccaggg ccactgctca gcccccagcc cctctcagcc 300 cagactctgg gtcacccagc ccagattctg ggtcagccag cccagtggaa gaagaggacg 360 tgggctcctc ggagaagctt ggcagggaga cggaggaaca ggacagcgac tctgcagagc 420 agggggatcc tgctggtgag gggaaagagg tcctgtgtga cttctgcctt gatgacacca 480 gaagagtgaa ggcagtgaag tcctgtctaa cctgcatggt gaattactgt gaagagcact 540 tgcagccgca tcaggtgaac atcaaactgc aaagccacct gctgaccgag ccagtgaagg 600 accacaactg gcgatactgc cctgcccacc acagcccact gtctgccttc tgctgccctg 660 atcagcagtg catctgccag gactgttgcc aggagcacag tggccacacc atagtctccc 720 tggatgcagc ccgcagggac aaggaggctg aactccagtg cacccagtta gacttggagc 780 ggaaactcaa gttgaatgaa aatgccatct ccaggctcca ggctaaccaa aagtctgttc 840 tggtgtcggt gtcagaggtc aaagcggtgg ctgaaatgca gtttggggaa ctccttgctg 900 ctgtgaggaa ggcccaggcc aatgtgatgc tcttcttaga ggagaaggag caagctgcgc 960 tgagccaggc caacggtatc aaggcccacc tggagtacag gagtgccgag atggagaaga 1020 gcaagcagga gctggagagg atggcggcca tcagcaacac tgtccagttc ttggaggagt 1080 actgcaagtt taagaacact gaagacatca ccttccctag tgtttacgta gggctgaagg 1140 ataaactctc gggcatccgc aaagttatca cggaatccac tgtacactta atccagttgc 1200 tggagaacta taagaaaaag ctccaggagt tttccaagga agaggagtat gacatcagaa 1260 ctcaagtgtc tgccgttgtt cagcgcaaat attggacttc caaacctgag cccagcacca 1320 gggaacagtt cctccaatat gcgtatgaca tcacgtttga cccggacaca gcacacaagt 1380 atctccggct gcaggaggag aaccgcaagg tcaccaacac cacgccctgg gagcatccct 1440 acccggacct ccccagcagg ttcctgcact ggcggcaggt gctgtcccag cagagtctgt 1500 acctgcacag gtactatttt gaggtggaga tcttcggggc aggcacctat gttggcctga 1560 cctgcaaagg catcgaccgg aaaggggagg agcgcaacag ttgcatttcc ggaaacaact 1620 tctcctggag cctccaatgg aacgggaagg agttcacggc ctggtacagt gacatggaga 1680 ccccactcaa agctggccct ttccggaggc tcggggtcta tatcgacttc ccgggaggga 1740 tcctttcctt ctatggcgta gagtatgata ccatgactct ggttcacaag tttgcctgca 1800 aattttcaga accagtctat gctgccttct ggctttccaa gaaggaaaac gccatccgga 1860 ttgtagatct gggagaggaa cccgagaagc cagcaccgtc cttggtgggg actgctccct 1920 agactccagg agccatatcc cagacctttg ccagctacag tgatgggatt tgcattttag 1980 ggtgatttgg gggcagaaat aactgctgat ggtagctggc ttttgaaatc ctatggggtc 2040 tctgaatgaa aacattctcc agctgctctc ttttgctcca tatggtgctg ttctctatgt 2100 gtttgcagta attctttttt ttttttttga gacggagtct cgcactgttg cccaggctgg 2160 agagcagtgg cgcgatcttg gctcactgca agctccgcct cccgagttca agcaattctc 2220 ctgcctcagc ctcccgagta gctgggatta caggtgcctg ccaccacacc cagctaatgt 2280 tttgtatttt tagtagagat ggggtttcac catgttggcc aggcagatct caaactcctg 2340 acctcgtgat gcacccacct cggcctccca aagtgctggg attacatgcg tgagccactg 2400 cgccctgcct gtttgtagta atttttaggc accaaatctc cctcatcttc tagtgccatt 2460 ctcctctctg ttcaggtaaa tgtcacactg tgcccagaat ggatgaccag gaaccttaaa 2520 gagtggctga aaagattgca gagttatcat aataaattgc taacttgcgt 2570 325 4082 DNA Homo sapiens 325 cgccattttc gagtgaagga cccggagccg aaacaccggt aggagcgggg aggtgggtac 60 tacacaaccg tctccagcct tggtctgagt ggactgtcct gcagcgacca tgccccgtaa 120 aggcacccag ccctccactg cccggcgcag agaggaaggg ccgccgccgc cgtcccctga 180 cggcgccagc agcgacgcgg agcctgagcc gccgtccggc cgcacggaga gcccagccac 240 cgccgcagag actgcaagtg aggaacttga taatagaagt ttagaagaga ttttgaacag 300 cattcctcct cccccgcctc cagcaatgac caatgaagct ggagctcctc ggcttatgat 360 aactcatatt gtaaaccaga acttcaaatc ctatgctggg gagaaaattc tgggaccttt 420 ccataagcgc ttttcctgta ttatcgggcc aaatggcagt ggcaaatcca atgttattga 480 ttctatgctt tttgtgtttg gctatcgagc acaaaaaata agatctaaaa aactctcagt 540 attaatacat aattctgatg aacacaagga cattcagagt tgtacagtag aagttcattt 600 tcaaaagata attgataagg aaggggatga ttatgaagtc attcctaaca gtaatttcta 660 tgtatccaga acggcctgca gagataatac ttctgtctat cacataagtg gaaagaaaaa 720 gacattaagg atgttggaat tcttcttcga agccatggaa ttgacttgga ccataataga 780 tttttaattt tacagggtga agttgaacaa attgctatga tgaaaccaaa aggccagact 840 gaacacgatg agggtatgct tgaatattta gaagatataa ttggttgtgg acggctaaat 900 gaacctatta aagtcttgtg tcggagagtt gaaatattaa atgaacacag aggagagaag 960 ttaaacaggg taaagatggt ggaaaaggaa aaggatgcct tagaaggaga gaaaaacata 1020 gctatcgaat ttcttacctt ggaaaatgaa atatttagaa aaaagaatca tgtttgtcaa 1080 tattatattt atgagttgca gaaacgaatt gctgaaatgg aaactcaaaa ggaaaaaatt 1140 catgaagata ccaaagaaat taatgagaag agcaatatac tatcaaatga aatgaaagct 1200 aagaataaag atgtaaaaga tacagaaaag aaactgaata aaattacaaa atttattgag 1260 gagaataaag aaaaatttac acagctagat ttggaagatg ttcaagttag agaaaagtta 1320 aaacatgcca cgagtaaagc caaaaaactg gagaaacaac ttcaaaaaga taaagaaaag 1380 gttgaagaat ttaaaagtat acctgccaag agtaacaata tcattaatga aacaacaacc 1440 agaaacaatg ccctcgagaa ggaaaaagag aaagaagaaa aaaaattaaa ggaagttatg 1500 gatagcctta aacaggaaac acaagggctt cagaaagaaa aagaaagtcg agagaaagaa 1560 cttatgggtt tcagcaaatc ggtaaatgaa gcacgttcaa agatggatgt agcccagtca 1620 gaacttgata tctatctcag tcgtcataat actgcagtgt ctcaattaac taaggctaag 1680 gaagctctaa ttgcagcttc tgagactctc aaagaaagga aagctgcaat cagagatata 1740 gaaggaaaac tccctcaaac tgaacaagaa ttaaaggaga aagaaaaaga acttcaaaaa 1800 cttacacaag aagaaacaaa ctttaaaagt ttggttcatg atctctttca aaaagttgaa 1860 gaagcaaaga gctcattagc aatgaatcga agtaggggga aagtccttga tgcaataatt 1920 caagaaaaaa aatctggcag gattccagga atatatggaa gattggggga cttaggagcc 1980 attgatgaaa aatacgacgt ggctatatca tcctgttgtc atgcactgga ctacattgtt 2040 gttgattcta ttgatatagc ccaagaatgt gtaaacttcc ttaaaagaca aaatattgga 2100 gttgcaacct ttataggttt agataagatg gctgtatggg cgaaaaagat gaccgaaatt 2160 caaactcctg aaaatactcc tcgtttattt gatttagtaa aagtaaaaga tgagaaaatt 2220 cgccaagctt tttattttgc tttacgagat accttagtag ctgacaactt ggatcaagcc 2280 acaagagtag catatcaaaa agatagaaga tggagagtgg taactttaca gggacaaatc 2340 atagaacagt caggtacaat gactggtggt ggaagcaaag taatgaaagg aagaatgggt 2400 tcctcacttg ttattgaaat ctctgaagaa gaggtaaaca aaatggaatc acagttgcaa 2460 aacgactcta aaaaagcaat gcaaatccaa gaacagaaag tacaacttga agaaagagta 2520 gttaagttac ggcatagtga acgagaaatg aggaacacac tagaaaaatt tactgcaagc 2580 atccagcgtt taatagagca agaagaatat ttgaatgtcc aagttaagga acttgaagct 2640 aatgtacttg ctacagcccc tgacaaaaaa aagcagaaat tgctagaaga aaacgttagt 2700 gctttcaaaa cagaatatga tgctgtggct gagaaagctg gtaaagtaga agctgaggtt 2760 aaacgcttac acaataccat cgtagaaatc aataatcata aactcaaggc ccaacaagac 2820 aaacttgata aaataaataa gcaattagat gaatgtgctt ctgctattac taaagcccaa 2880 gtagcaatca agactgctga cagaaacctt caaaaggcac aagactctgt cttgcgtaca 2940 gagaaagaaa taaaagatac tgagaaagag gtggatgacc taacagcaga gctgaaaagt 3000 cttgaggaca aagcagcaga ggtcgtaaag aatacaaatg ctgcagagga atccttacca 3060 gagatccaga aagaacatcg caatctgctt caagaattaa aagttattca agaaaatgaa 3120 catgctcttc aaaaagatgc acttagtatt aagttgaaac ttgaacaaat agatggtcac 3180 attgctgaac ataattctaa aataaaatat tggcacaaag agatttcaaa aatatcactg 3240 catcctatag aagataatcc tattgaagag atttcggttc taagcccaga ggatcttgaa 3300 gcgatcaaga atccagattc tataacaaat caaattgcac ttttggaagc ccggtgtcat 3360 gaaatgaaac caaacctcgg tgccatcgca gagtataaaa agaaggaaga attgtatttg 3420 caacgggtag cagaattgga caaaattact tatgaaagag acagttttag acaggcatat 3480 gaagatcttc ggaaacaaag gcttaatgaa tttatggcag gtttttatat aataacaaat 3540 aaattaaagg aaaattacca aatgcttact ttgggagggg acgccgaact cgagcttgta 3600 gacagcttgg atcctttctc tgaaggaatc atgttcagtg ttcgaccacc taagaaaagt 3660 tggaaaaaga tcttcaacct ttcgggagga gagaaaacac ttagttcatt ggctttagta 3720 tttgctcttc accactacaa gcccactccc ctttacttca tggatgagat tgatgcagcc 3780 cttgatttta aaaatgtgtc cattgttgca ttttatatat atgaacaaac aaaaaatgca 3840 cagttcataa taatttctct tcgaaataat atgtttgaga tttcggatag acttattgga 3900 atttacaaga catacaacat aacaaaaagt gttgctgtaa atccaaaaga aattgcatct 3960 aagggacttt gttgaacttt atgctgaaga ttcttcaagt tgattcagtg tattactgat 4020 ttttttctat ttgtaaagga ttatgagttg tataaaatac atactcccta aactagatca 4080 tg 4082 326 4701 DNA Homo sapiens 326 gcggccgcca cacccagcac cacaggcacc aagtccaaca cgcccacatc ctccgtgccc 60 tcggccgccg tcacacccct caacgagagc ctgcagcccc tgggggacta tggcgtgggc 120 tccaagaaca gcaagcgtgc ccgggagaag cgcgacagcc gcaacatgca agtacaggtc 180 acccaggaga tgcgcaacgt cagtataggc atgggcagca gtgacgagtg gtctgatgtt 240 caagacatta ttgactccac gccagagctg gacatgtgtc cagagacccg cctggaccgc 300 acaggaagca gcccaaccca gggcatcgtg aacaaagctt tcggcatcaa caccgactcc 360 ctgtaccatg agctgtcgac ggcagggtct gaggtcatcg gggatgtgga cgaaggggcc 420 gacctcctag gggagttctc aggaatgggc aaagaagtgg ggaatctgct actggaaaac 480 tcacagcttc tggaaaccaa aaacgccttg aatgtggtga agaatgacct gattgccaag 540 gtcgaccagc tgtccgggga gcaggaggtg ctgaggggcg agttggaggc tgctaagcag 600 gccaaagtca agctggaaaa ccgtatcaag gagctggaag aggaactgaa aagagtgaag 660 tccgaggcca tcatcgcccg ccgtgaaccc aaagaagagg cggaggatgt aagcagctat 720 ctctgtacag aatcggacaa aatccccatg gcccagcgcc gccgcttcac gcgggtggag 780 atggcccgtg tgctcatgga gcggaaccag tacaaggagc ggctgatgga gctgcaggag 840 gctgtgcggt ggactgagat gatcagagcg tcccgagagc acccatccgt ccaggagaag 900 aagaagtcga ccatctggca gttcttcagc cgcctcttca gctcttcctc cagcccccct 960 ccggccaagc gcccctatcc ctcggtgaac atccactaca agtcacccac cactgccggc 1020 ttcagccagc gccgcaacca tgccatgtgc ccgatctcgg caggcagccg gcccctggaa 1080 ttcttccctg acgacgactg cacgtcctcc gcccgtcgag agcagaagcg cgagcagtac 1140 cgccaggtgc gtgagcacgt gcgtaacgac gacggccgtc tgcaggcctg cggctggagc 1200 ctgcccgcca agtacaagca gctgagtccc aacgggggcc aggaggacac gcggatgaag 1260 aacgtgccgg tgccggtgta ctgccgccct ctggtggaga aggaccccac catgaagctg 1320 tggtgtgccg cgggcgtcaa cctgagcggg tggaggccca atgaggacga cgctgggaat 1380 ggagtcaagc cagcgccagg ccgcgatccc ctgacctgcg accgcgaagg agacggcgag 1440 cccaagagcg cccacgcgtc tcccgagaag aagaaggcca aggagctccc tgaaatggac 1500 gccacctcca gccgggtgtg gatcctgacc agcaccctga ccaccagcaa ggtggtgatc 1560 atcgacgcca accagccggg cacggtggtg gaccagttca ccgtctgcaa cgcgcacgtg 1620 ctgtgcatct ccagcatccc cgcggccagc gacagcgact accctcccgg ggagatgttc 1680 ctggacagcg acgtgaaccc agaggacccg ggcgcagatg gcgtgctggc cggtatcacc 1740 ctggtgggct gtgccacccg ctgcaacgtg ccgcggagca actgctcctc ccgaggggac 1800 accccagtgc tagacaaggg gcagggggag gtggccacca tcgccaacgg gaaggtcaac 1860 ccgtcccagt ccacagagga ggccacagag gccacggagg tgccagaccc tgggcccagc 1920 gagccagaga cagccacatt gcggcccggg cctctcacag agcacgtctt cactgaccca 1980 gccccgaccc cgtcctctgg cccccagcct ggcagcgaga acgggccaga gcctgacagc 2040 agcagcacac ggccagagcc agagcccagc ggggacccca cgggagcagg cagcagtgct 2100 gcacccacca tgtggctggg agcccagaac ggctggctct atgtgcactc ggctgtggcc 2160 aactggaaga agtgcctgca ctccatcaag ctgaaggatt ctgtgctgag cctggtgcat 2220 gtcaaaggcc gtgtgctggt ggctctggcg gacgggaccc tggccatctt ccaccgtggt 2280 gaagatggcc agtgggatct gagcaactat cacctaatgg acctgggcca cccgcaccac 2340 tccatccgct gcatggctgt tgtgtacgac cgcgtgtggt gtggctacaa gaacaaggtg 2400 cacgtcatcc agcccaagac catgcagata gagaagtcat ttgacgccca cccgcggcgg 2460 gagagccagg tgcggcagct ggcgtggatc ggcgatggcg tatgggtgtc catccgcctg 2520 gactccaccc tgaggctcta ccatgcacac acgcaccagc atctacagga cgtggacatt 2580 gagccctacg tcagcaagat gctaggcact ggcaagctgg gtttctcctt cgtacgcatc 2640 acggccctgc ttgtcgcggg cagccggctc tgggtgggca ccggcaacgg agtggtcatc 2700 tccatccccc tgacagagac tgtggtcctg caccgaggcc agctcctggg gctccgagcc 2760 aataagacat cccccacctc tggggagggc gcccgtcccg ggggcatcat ccacgtgtat 2820 ggcgatgaca gcagtgacag ggcggccagc agcttcatcc cctactgctc catggcccag 2880 gcccagctat gcttccatgg gcaccgcgat gccgtgaagt tctttgtctc ggtgccaggg 2940 aacgtgctgg ccaccctgaa tggcagtgtg ctggacagcc cagccgaggg ccctgggcca 3000 gctgcccctg cctcggaggt cgagggccag aagctgcgga acgtgctggt gctgagcggc 3060 ggggagggct acatcgactt ccgcattgga gacggagagg acgacgagac ggaggagggc 3120 gcaggggaca tgagccaggt gaagcccgtg ctgtccaagg cagagcgcag tcacatcatc 3180 gtgtggcagg tgtcctacac ccccgagtga agctgctgcc ctgcctggcc cgacctgtac 3240 ataggacccc cgaccacctg acccccgccc ggcccgcggg gtagccagcc aggcgccgcc 3300 gcccctcttc taacctctca acctgcagct ttcacctgag tctggcccct ccagcgggca 3360 gggagtgcgg ggatgcggat cagctgggag gaggagggga ggggtgcttc cacccgaggg 3420 gaagatgctc tcgggacagt ttcccgggca gctcctggcc agcttccagc ccagagtcct 3480 caagtccagg gcaccttggg cccagcgcag gcagaatccg aggtggtcct ggctctaccc 3540 tgggcctcct actccccagc acccctggag gaggcagggg ctccccgccg ccgaggctgc 3600 ctgccctggg cccacctctg catgctgctc atggggccac cctgcctcct gggccctcac 3660 tctgcctagg ggagctgggc caggcactag cctttgccca gggaggtggg cctcaggctg 3720 cccaggtgcc tgcaccccag ccggccttct ctggggcctc cccgtcgtca agcctctatc 3780 ctgtctgtcc ccaccccagc tgtcccctgc ccagggagct ggcataaaag cacgaggccc 3840 ggctccctgg ggcagctgct tgagaacaga gactgctacc ccatcctgcc catgcaggca 3900 ggctcttgcc agccccgttc tgacccgtgt ccccccaggc tctgcctggg cagaagactc 3960 accttggagg agtgggccct ggagtcctgt ccctcccaga agcccccagg gtgggatttc 4020 tcaggctgcc agggcaggcc caggcctcag gaagaagggg aggcccctgg cctctccggg 4080 atcagtccta ggacacaggc tcagcctcag gttgatgggg gatgatgtgc tcccggggcc 4140 tgcctcctgc acggggctcc acggagccca gctcccagac acgctactaa gtgcctaggg 4200 ttgcccgctg tggcctgctc ccagggagca acagagaggc caccaagcag aggcccgtgg 4260 ggctgaggat ggagccgccc ccagccgact ccaagcccgc agagggcaga cgccaccctg 4320 gactgctctc cctgcccagc tgggcctctc tggcctattc ctaccttcca ggcccactgc 4380 actcctgtct gggaggccct tatgagggca gcccagcccc cgcacccacc cccaaccaga 4440 gaagcacaga tcttggggag ctgccccaca agccccgctg gccaccgagg gctgcagccg 4500 ctgcgctgcc ggcttctccc caccaccctg ccacctccac tgtgatgtat gtccgctccc 4560 tcgtctgttc ccccaggatc tcgaagtgac tccgggctga gcagtggggc ggctggggga 4620 ggggtgacga ttctcctcag gctttggccc tgcaagcaaa cccacatatc tgctctgtat 4680 gtaataaatg tcttaacgtc g 4701 327 1039 DNA Homo sapiens 327 tgcctgtctt ttccgtgcta cctgcagagg ggtccatacg gcgttgttct ggattcccgt 60 cgtaacttaa agggaaactt tcacaatgtc cggagccctt gatgtcctgc aaatgaagga 120 ggaggatgtc cttaagttcc ttgcagcagg aacccactta ggtggcacca atcttgactt 180 yccagatggaa cagtacatct ataaaaggaa aagtgatggc atctatatca taaatctcaa 240 gaggacctgg gagaagcttc tgctggcagc tcgtgcaatt gttgccattg aaaaccctgc 300 tgatgtcagt gttatatcct ccaggaatac tggccagagg gctgtgctga agtttgctgc 360 tgccactgga gccactccaa ttgctggccg cttcactcct ggaaccttca ctaaccagat 420 ccaggcagcc ttccgggagc cacggcttct tgtggttact gaccccaggg ctgaccacca 480 gcctctcacg gaggcatctt atgttaacct acctaccatt gcgctgtgta acacagattc 540 tcctctgcgc tatgtggaca ttgccatccc atgcaacaac aagggagctc actcagtggg 600 tttaatgtgg tggatgctgg ctcgggaagt tctgcgcatg cgtggcacca tttcccgtga 660 acacccatgg gaggtcatgc ctgatctgta cttctacaga gatcctgaag agattgaaaa 720 agaagagcag gctgctgctg agaaggcagt gaccaaggag gaatttcagg gtgaatggac 780 tgctcccgct cctgagttca ctgctactca gcctgaggtt gcagactggt ctgaaggtgt 840 acaggtgccc tctgtgccta ttcagcaatt ccctactgaa gactggagcg ctcagcctgc 900 cacggaagac tggtctgcag ctcccactgc tcaggccact gaatgggtag gagcaaccac 960 tgactggtct taagctgttc ttgcataggc tcttaagcag catggaaaaa tggttgatgg 1020 aaaataaaca tcagtttct 1039 328 1430 DNA Homo sapiens 328 aggtgagaga ggatgtgtgc tgggccttgg aggaaggggg ccgagaccgg gccttacttc 60 tgtaacgata ctgtgaggca tcggaaggcc agcctgttgt gtccgttttg aaggatgccc 120 ctgtcccgct ggttgagatc tgtgggggtc ttcctgctgc cagcccccta ctgggcaccc 180 cgggagaggt ggctgggttc cctacggcgg ccctccctgg tgcacgggta cccagtcctg 240 gcctggcaca gtgcccgctg ctggtgccaa gcgtggacag aggaacctcg agccctttgc 300 tcctccctca gaatgaacgg agaccagaat tcagatgttt atgcccaaga aaagcaggat 360 ttcgttcagc acttctccca gatcgttagg gtgctgactg aggatgagat ggggcaccca 420 gagataggag atgctattgc ccggctcaag gaggtcctgg agtacaatgc cattggaggc 480 aagtataacc ggggtttgac ggtggtagta gcattccggg agctggtgga gccaaggaaa 540 caggatgctg atagtctcca gcgggcctgg actgtgggct ggtgtgtgga actgctgcaa 600 gctttcttcc tggtggcaga tgacatcatg gattcatccc ttacccgccg gggacagatc 660 tgctggtatc agaagccggg cgtgggtttg gatgccatca atgatgctaa cctcctggaa 720 gcatgtatct accgcctgct gaagctctat tgccgggagc agccctatta cctgaacctg 780 atcgagctct tcctgcagag ttcctatcag actgagattg ggcagaccct ggacctcctc 840 acagcccccc agggcaatgt ggatcttgtc agattcactg aaaagaggta caaatctatt 900 gtcaagtaca agacagcttt ctactccttc taccttccta tagctgcagc catgtacatg 960 gcaggaattg atggcgagaa ggagcacgcc aatgccaaga agatcctgct ggagatgggg 1020 gagttctttc agattcagga tgattacctt gacctctttg gggaccccag tgtgaccggc 1080 aaaattggca ctgacatcca ggacaacaaa tgcagctggc tggtggttca gtgtctgcaa 1140 cgggccactc cagaacagta ccagatcctg aaggaaaatt acgggcagaa ggaggctgag 1200 aaagtggccc gggtgaaggc gctatatgag gagctggatc tgccagcagt gttcttgcaa 1260 tatgaggaag acagttacag ccacattatg gctctcattg aacagtacgc agcacccctg 1320 cccccagccg tctttctggg gcttgcgcgc aaaatctaca agcggagaaa gtgacctaga 1380 gattgcaagg gcggggagag gaggctctca ataaataatc gtgtaacctt 1430 329 3047 DNA Homo sapiens misc_feature (1)...(3047) n = A,T,C or G 329 aggcagggcg ggcgggcgct ctaagggttc tgctctgact ccaggttggg acagcgtctt 60 cgctgctgct ggatagtcgt gttttcgggg atcgaggata ctcaccagaa accgaaaatg 120 ccgaaaccaa tcaatgtccg agttaccacc atggatgcag agctggagtt tgcaatccag 180 ccaaatacaa ctggaaaaca gctttttgat caggtggtaa agactatcgg cctccgggaa 240 gtgtggtact ttggcctcca ctatgtggat aataaaggat ttcctacctg gctgaagctg 300 gataagaagg tgtctgccca ggaggtcagg aaggagaatc ccctccagtt caagttccgg 360 gccaagttct accctgaaga tgtggctgag gagctcatcc aggacatcac ccagaaactt 420 ttcttcctcc aagtgaagga aggaatcctt agcgatgaga tctactgccc ccctgagact 480 gccgtgctct tggggtccta cgctgtgcag gccaagtttg gggactacaa caaagaagtg 540 cacaagtctg ggtacctcag ctctgagcgg ctgatccctc aaagagtgat ggaccagcac 600 aaacttacca gggaccagtg ggaggaccgg atccaggtgt ggcatgcgga acaccgtggg 660 atgctcaaag ataatgctat gttggaatac ctgaagattg ctcaggacct ggaaatgtat 720 ggaatcaact atttcgagat aaaaaacaag aaaggaacag acctttggct tggagttgat 780 gcccttggac tgaatattta tgagaaagat gataagttaa ccccaaagat tggctttcct 840 tggagtgaaa tcaggaacat ctctttcaat gacaaaaagt ttgtcattaa acccatcgac 900 aagaaggcac ctgactttgt gttttatgcc ccacgtctga gaatcaacaa gcggatcctg 960 cagctctgca tgggcaacca tgagttgtat atgcgccgca ggaagcctga caccatcgag 1020 gtgcagcaga tgaaggccca ggcccgggag gagaagcatc agaagcagct ggagcggcaa 1080 cagctggaaa cagagaagaa aaggagagaa accgtggaga gagagaaaga gcagatgatg 1140 cgcgagaagg aggagttgat gctgcggctg caggactatg aggagaagac aaagaaggca 1200 gagagagagc tctcggagca gattcagagg gccctgcagc tggaggagga gaggaagcgg 1260 gcacaggagg aggccgagcg cctagaggct gaccgtatgg ctgcactgcg ggctaaggag 1320 gagctggaga gacaggcggt ggatcagata aagagccagg agcagctggc tgcggagctt 1380 gcagaataca cagccaagat tgccctcctg gaagaggcgc ggaggcgcaa ggaggatgaa 1440 gttgaagagt ggcagcacag ggccaaagaa gcccaggatg acctggtgaa gaccaaggag 1500 gagctgcacc tggtgatgac agcacccccg cccccaccac cccccgtgta cgagccggtg 1560 agctaccatg tccaggagag cttgcaggat gagggcgcag agcccacggg ctacagcgcg 1620 gagctgtcta gtgagggcat ccgggatgac cgcaatgagg agaagcgcat cactgaggca 1680 gagaagaacg agcgtgtgca gcggcagctc gtgacgctga gcagcgagct gtcccaggcc 1740 cgagatgaga ataagaggac ccacaatgac atcatccaca acgagaacat gaggcaaggc 1800 cgggacaagt acaagacgct gcggcagatc cggcagggca acaccaagca gcgcatcgac 1860 gagttcgagg ccctgtaaca gccaggccag gaccaagggc agaggggtgc tcatagcggg 1920 cgctgccagc cccgccacgc ttgtctttag tgctccaagt ctaggaactc cctcagatcc 1980 cagttccttt agaaagcagt tacccaacag aaacattctg ggctgggaac cagggaggcg 2040 ccctggtttg ttttccccag ttgtaatagt gccaagcagg cctgattctc gcgattattc 2100 tcgaatcacc tcctgtgttg tgctgggagc aggactgatt gaattacgga aaatgcctgt 2160 aaagtctgag taagaaactt catgctggcc tgtgtgatac aagagtcagc atcattaaag 2220 gaaacgtggc aggacttcca tctgtgccat acttgttctg tattcgaaat gagctcaaat 2280 tgattttttt aatttctatg aaggatccat ctttgtatat ttacatgctt agaggggtga 2340 aaattatttt ggaaattgag tctgaagcac tctcgcacac acagtgattc cctcctcccg 2400 tcactccacg cagctggcag agagcacagt gatcaccagc gtgagtggtg gaggaggaca 2460 cttggatatt tttttagttc tttttttttt ggcttaacag ttttagaata cattgtactt 2520 atacacctta ttaatgatca gctatatact atttatatac aagtgataat acagatttgt 2580 aacattagtt ttaaaaaggg aaagttttgt tctgtatatt ttgttacctt ttacagaata 2640 aaagaattac atatgaaaaa ccctctaaac catggcactt gatgtgatgt ggcaggaggg 2700 nagtggtgga gctggacctg cctgctgcag ctgcagtcac gtgtaaacag gattattatt 2760 agtgttttat gcatgtaatg gactatgcac acttttaatt ttgtcagatt cacacatgcc 2820 actatgagct ttcagactcc agctgtgaag agactctgtc tgcttgtgtt tgtttgcagt 2880 ctctctctgc catggccttg gcaggctgct ggaaggcagc ttgtggaggc cgttggttcc 2940 gcccactcat tccttctcgt gcactgcttt ctccttcaca gctaagatgc catgtgcagg 3000 tggattccat gccgcagaca tgaaataaaa gctttgcaaa ggcaaag 3047 330 1746 DNA Homo sapiens 330 tgggatcatt aaagctgcac tcgcataatt tacaatggaa ccgaattact tttccttcat 60 catttcgtac ctcttccaca taatcatggc ccactggctg cacatcactc tgtaaagcag 120 caagagatgc aggtgtgact ggttctgaca ctgtgtcttg ctttacttca ggaatctgca 180 cagcagaagt gacaggagta cttttaacac attcggttcc ttttatgtct tctgctttat 240 ttcctgttga ctgcagctta ttaccaccaa caaaatttat tttgggggta gatgttttct 300 tggacgccat atttgtaggc actgctgata ctttagtgtt agatgtgcta ttaagagacg 360 agtttcctgt agtcgtaaga cccttcattg aagacgttgc aattgatgac gtattcacag 420 tacaattgtt tgctgcaatg cttgaaggag gcagtcggct ttgaagcaga tacagctgtt 480 gaagaagtag cttggctaac aacatttggt tctgttgatg gaatgggttt accaagtttt 540 gagagaaact taaccacttt ctgatgctta gcaccacgaa tggtcagcat acgcatctgc 600 tcctgtacaa gacacatcgc agagctcaca acgtagctga ttttgagtcc cacgagtaga 660 actgttgctg ctgctggtat tttgtgaggc tttcaatgca gcttcttttt ttttatgttt 720 ctgtccttct aaatgttctt tataagtctg tggtccagca cagctgatct acaaacatca 780 caatagtgaa tctgtggtgg tttgggaggc tgttttggtt tcagttgttt attttggaat 840 ggtgcttttt agtaaaggtg gtccctgtcc aggcagctgt tgcagcagca gcagcagctg 900 ctgctgctgc ctgcttctgt tgctgctgct gctgtgaagt aggaagatgc agctgataca 960 ctgctgcttc ataacctgaa taagacgtac cagaatatgt aactctgtgc gacttaagta 1020 gcactctgag tataggatgg caccacagat gccgcagctg ctacaggctg tacggtggag 1080 gatacaggat agatggagaa agtagtggta gctggacttg gtgtggctgg tttatggctg 1140 tcacttggtc gagtttgctg ggcttgagta tactgagttg caccttggct gtaaccttgc 1200 tttgggggct gtctgatagt aagtttcagc aacagaaggc tgaggttggg cagcggcagc 1260 tacagcagca gcagttgctg tgttggttgt tggtagtatt gcttactatc ataagctaca 1320 gcaggagctg tgaccttaca tatgagtatg aatcctggta gttttgtgta gtagctgggg 1380 gtggtggtgg tggtgcttct tgttgcctct gagtataacc atagtcagtt gctgtgtgtg 1440 cagtggggta gcctccataa gcagcagctg ttgcagcagc tgcaacagct actggagcag 1500 gcctggcaac tgcaactgtg gcggctgctg gtgcataggc agcagtaact gtgtgagcag 1560 ctactggagc ctgatggaca gtgtagctag caactgtagt tggatgagaa taggctacac 1620 ccgaagctgg ctgctggcta tattgggccg cagcgccgcc gccgccgccc cgctgtgggt 1680 gaatccaaag tagttgccgg tcgccatttt ggcatcttcc cccagccggc tgggcacata 1740 ggtgaa 1746 331 1956 DNA Homo sapiens 331 catgattccc aagcttggca cgagggtccg caagcccggc tgagagcgcg ccatggggca 60 ggcgggctgc aaggggctct gcctgtcgct gttcgactac aagaccgaga agtatgtcat 120 cgccaagaac aagaaggtgg gcctgctgta ccggctgctg caggcctcca tcctggcgta 180 cctggtcgta tgggtgttcc tgataaagaa gggttaccaa gacgtcgaca cctccctgca 240 gagtgctgtc atcaccaaag tcaagggcgt ggccttcacc aacacctcgg atcttgggca 300 gcggatctgg gatgtcgccg actacgtcat tccagcccag gaaaagaacg tcttttttgt 360 ggtcaccaac ctgattgtga cccccaacca gcggcagaac gtctgtgctg agaatgaagg 420 cattcctgat ggcgcgtgct ccaaggacag cgactgccac gctggggaag cggttacagc 480 tggaaacgga gtgaagaccg gccgctgcct gcggagagag aacttggcca ggggcacctg 540 tgagatcttt gcctggtgcc cgttggagac aagctccagg ccggaggagc cattcctgaa 600 ggaggccgaa gacttcacca ttttcataaa gaaccacatc cgtttcccca aattcaactt 660 ctccaacaat gtgatggacg tcaaggacag atctttcctg aaatcatgcc actttggccc 720 caagaaccac tactgcccca tcttccgact gggctccgtg atccgctggg ccgggagcga 780 cttccaggat atagccctgg agggtggcgt gataggaatt aatattgaat ggaactgtga 840 tcttgataaa gctgcctctg agtgccaccc tcactattct tttagccgtc tggacaataa 900 actttcaaag tctgtctcct ccgggtacaa cttcagattt gccagatatt accgagacgc 960 agccggggtg gagttccgca ccctgatgaa agcctacggg atccgctttg acgtgatggt 1020 gaacggcaag ggtgcctcct tctgcgacct ggtactcatc tacctcatca aaaagagaga 1080 gttttaccgt gacaagaagt accaggaagt gaggggccta gaagacagtt cccaggaggc 1140 cgaggacgag gcatcggggc tggggctatc tgagcagctc acatctgggc cagggctgct 1200 ggggatgccg gagcagcagg agctgcagga gccacccgag gcgaacgttg gaagcagcag 1260 tcagaagggg aacggatctg tgtgcccaca gctcctggag ccccacagga gcacgtgaat 1320 tgcctctgct tacgttcagg ccctgtccta aacccagccg tctagcaccc agtgatccca 1380 tgcctttggg aatcccagga tgctgcccaa cgggaaattt gtacattggg tgctatgaat 1440 gccacatcac agggaccagc catcacagag caaagtgacc tccacgtctg atgctggggt 1500 catcaggacg gacccatcat ggctatcttt ttgccccacc ccctgccgtc agttcttcct 1560 ttctccgtgg ctggcttccc gcactaggga acgggttgta aatggggaac atgacttcct 1620 tccggagtcc ttgagcacct cagctaagga ccgcagtgcc ctgtagagtt cctagattac 1680 ctcactggga atagcattgt gcgtgtccgg aaaagggctc catttggttc cagcccactc 1740 ccctctgcaa gtgccgcagc ttccctcagg catactctcc agtggatcca agtactctct 1800 ctcctaaaga caccaccttc ctgccagctg tttgccctta ggccagtaca cagaattgaa 1860 agtgggggag gtggcagacg ctttctggga cctgcccaag atatgtattc tctgacactc 1920 ttatttggtc ataaaacaat aaatggtgtc aatttc 1956 332 1367 DNA Homo sapiens 332 caaggccgca ccgtactggg cgggggtctg gggagcgcag cagccatggc aagccgtctc 60 ctgctcaaca acggcgccaa gatgcccatc ctggggttgg gtacctggaa gtcccctcca 120 gggcaggtga ctgaggccgt gaaggtggcc attgacgtcg ggtaccgcca catcgactgt 180 gcccatgtgt accagaatga gaatgaggtg ggggtggcca ttcaggagaa gctcagggag 240 caggtggtga agcgtgagga gctcttcatc gtcagcaagc tgtggtgcac gtaccatgag 300 aagggcctgg tgaaaggagc ctgccagaag acactcagcg acctgaagct ggactacctg 360 gacctctacc ttattcactg gccgactggc tttaagcctg ggaaggaatt tttcccattg 420 gatgagtcgg gcaatgtggt tcccagtgac accaacattc tggacacgtg ggcggccatg 480 gaagagctgg tggatgaagg gctggtgaaa gctattggca tctccaactt caaccatctc 540 caggtggaga tgatcttaaa caaacctggc ttgaagtata agcctgcagt taaccagatt 600 gagtgccacc catatctcac tcaggagaag ttaatccagt actgccagtc caaaggcatc 660 gtggtgaccg cctacagccc cctcggctct cctgacaggc cctgggccaa gcccgaggac 720 ccttctctcc tggaggatcc caggatcaag gcgatcgcag ccaagcacaa taaaactaca 780 gcccaggtcc tgatccggtt ccccatgcag aggaacttgg tggtgatccc caagtctgtg 840 acaccagaac gcattgctga gaactttaag gtctttgact ttgaactgag cagccaggat 900 atgaccacct tactcagcta caacaggaac tggagggtct gtgccttgtt gagctgtacc 960 tcccacaagg attacccctt ccatgaagag ttttgaagct gtggttgcct gctcgtcccc 1020 aagtgaccta tacctgtgtt tcttgcctca tttttttcct tgcaaatgta gtatggcctg 1080 tgtcactcag cagtgggaca gcaacctgta gagtggccag cgagggcgtg tctagcttga 1140 tgttggatct caagagccct gtcagtagag tagaagtctc ttccagtttg ctttgccctt 1200 ctttctaccc tgctggggaa agtacaacct gaataccctt ttctgaccaa agagaagcaa 1260 aatctaccag gtcaaaatag tgccactaac ggttgagttt tgactgcttg gaactggaat 1320 cctttcagca agacttctct ttgcctcaaa taaaaagtgc ttttgtg 1367 333 7995 DNA Homo sapiens 333 atgggtcagg tatagccagg ctggagaaaa gaagctgcca ccatggttgc actttcactg 60 aagatcagca ttgggaatgt ggtgaagacg atgcagtttg agccgtctac catggtgtac 120 gacgcctgcc gcatcattcg tgagcggatc ccagaggccc cagctggtcc tcccagcgac 180 tttgggctct ttctgtcaga tgatgacccc aaaaagggta tatggctgga ggctgggaaa 240 gctttggact actacatgct ccgaaatggg gacactatgg agtacaggaa gaaacagaga 300 cccctgaaga tccgtatgct ggatggaact gtgaagacga tcatggtgga tgactctaag 360 actgtcactg acatgctcat gaccatctgt gcccgcattg gcatcaccaa tcatgatgaa 420 tattcattgg ttcgagagct gatggaagag aaaaaggagg aaataacagg gaccttaaga 480 aaggacaaga cattgctgcg agatgaaaag aagatggaga aactaaagca gaaattgcac 540 acagatgatg agttgaactg gctggaccat ggtcggacac tgagggagca gggtgtagag 600 gagcacgaga cgctgctgct gcggaggaag ttcttttact cagaccagaa tgtggattcc 660 cgggaccctg tacagctgaa cctcctgtat gtgcaggcac gagatgacat cctgaatggc 720 tcccaccctg tctcctttga caaggcctgt gagtttgctg gcttccaatg ccagatccag 780 tttgggcccc acaatgagca gaagcacaag gctggcttcc ttgacctgaa ggacttcctg 840 cccaaggagt atgtgaagca gaagggagag cgtaagatct tccaggcaca caagaattgt 900 gggcagatga gtgagattga ggccaaggtc cgctacgtga agctagcccg ttctctcaag 960 acttacggtg tctccttctt cctggtgaag gaaaaaatga aagggaagaa caagctagtg 1020 cccaggcttc tgggcatcac caaggagtgt gtgatgcgag tggatgagaa gaccaaggaa 1080 gtgatccagg agtggaacct caccaacatc aaacgctggg ctgcgtctcc caaaagcttc 1140 accctggatt ttggagatta ccaagatggc tattactcag tacagacaac tgaaggggag 1200 cagattgcac agctcattgc cggctacatc gatatcatcc tgaagaagaa aaaaagcaag 1260 gatcactttg ggctggaagg agatgaggag tctactatgc tggaggactc agtgtccccc 1320 aaaaagtcaa cagtcctgca gcagcaatac aaccgggtgg ggaaagtgga gcatggctct 1380 gtggccctgc ctgccatcat gcgctctgga gcctctggtc ctgagaattt ccaggtgggc 1440 agcatgcccc ctgcccagca gcagattacc agcggccaga tgcaccgagg acacatgcct 1500 cctctgactt cagcccagca ggcactcact ggaaccatta actccagcat gcaggccgtg 1560 caggctgccc aggccaccct ggatgacttt gacactctgc cgcctcttgg ccaggatgct 1620 gcctctaagg cctggcgtaa aaacaagatg gatgaatcaa agcatgagat ccactctcag 1680 gtagatgcca tcacagctgg tactgcgtct gtggtgaacc tgacagcagg ggaccctgct 1740 gagacagact ataccgcagt gggctgtgca gtcaccacaa tctcctccaa cctgacggag 1800 atgtcccgtg gggtgaagct gctggctgcc ttgctggagg acgaaggcgg cagtggtcgg 1860 cccctgttgc aggcagcaaa gggccttgcg ggagcagtgt cagaactgct gcgcagtgcc 1920 caaccagcca gtgctgagcc ccgtcagaac ctgctgcaag cagctgggaa cgtgggccag 1980 gccagtgggg agctgttgca acaaattggg gaaagtgata ctgaccccca cttccaggat 2040 gcgctaatgc agctcgccaa agctgtggca agtgctgcag ctgccctggt cctcaaggcc 2100 aagagtgtgg cccagcggac agaggactcg ggacttcaga cccaagttat tgctgcagca 2160 acacagtgtg ccctatccac ttcccaacta gtggcctgta ctaaggtggt ggcacctaca 2220 atcagctcac ctgtctgcca agagcaactg gtggaggctg gacgactggt agccaaagcc 2280 gtggagggct gtgtgtctgc ctcccaggca gctacagagg atgggcaact gttgcgaggg 2340 gtaggagcag cagccacagc tgtcacccag gccctaaatg agctgctgca gcatgtgaaa 2400 gcccatgcca caggggctgg gcctgctggc cgttatgacc aggctactga caccatccta 2460 accgtcactg agaacatctt tagctccatg ggtgatgctg gggagatggt gcgacaggcc 2520 cgcatcctgg cccaagccac atctgacctg gtcaatgcca tcaaggctga tgctgagggg 2580 gaaagtgatc tggagaactc ccgcaagctc ttaagtgctg ccaagatcct agctgatgcc 2640 acagccaaga tggtagaggc tgccaaggga gcagctgccc accctgacag tgaggagcag 2700 cagcagcggc tgcgggaggc agctgagggg ctgcgcatgg ccaccaatgc agctgcgcag 2760 aatgccatca agaaaaagct ggtgcagcgc ctggagcatg cagccaagca ggctgcagcc 2820 tcagccacac agaccatcgc tgcagctcag cacgcagcct ctacccccaa ggcctctgcc 2880 ggcccccagc ccctgctggt gcagagctgc aaggcagtgg cagagcagat tccactgctg 2940 gtgcagggcg tccgaggaag ccaagcccag cctgacagcc ccagcgctca gcttgccctc 3000 attgctgcca gccagagctt cctgcagcca ggtgggaaga tggtggcagc tgcaaaggcc 3060 tcagtgccaa cgattcagga ccaggcttca gccatgcagc tgagtcagtg tgccaagaac 3120 ctgggcaccg cgctggctga actccggacg gctgcccaga aggctcagga agcatgtgga 3180 cctttggaga tggattctgc actgagtgtg gtacagaatc tagagaaaga tctacaggaa 3240 gtgaaggcag cagctcgaga tggcaagctt aaacccttac ctggggagac aatggagaag 3300 tgtacccagg acctgggcaa cagcaccaaa gccgtgagct cagccatcgc ccagctactg 3360 ggagaggttg cccagggcaa tgagaattat gcaggtattg cagctcggga tgtggcaggt 3420 gggctgcggt cactggccca ggccgctagg ggagtcgctg cactgacgtc agatcctgca 3480 gtgcaggcca ttgtacttga tacggccagt gatgtgctgg acaaggccag cagcctcatt 3540 gaggaggcga aaaaggcagc tggccatcca ggggaccctg agagccagca gcggcttgcc 3600 caggtggcta aagcagtgac ccaggctctg aaccgctgtg tcagctgcct acctggccag 3660 cgcgatgtgg ataatgccct gagggcagtt ggagatgcca gcaagcgact cctgagtgac 3720 tagcttcctc ctagcactgg gacatttcaa gaagctcaga gccggttgaa tgaagctgct 3780 gctgggctga atcaggcagc cacagaactg gtgcaggcct ctcggggaac ccctcaggac 3840 ctggctcgag cctcaggccg atttggacag gacttcagca ccttcctgga agctggtgtg 3900 gagatggcag gccaggctcc gagccaggag gaccgagccc aagttgtgtc caacttgaag 3960 ggcatctcca tgtcttcaag caaacttctt ctggctgcca aggccctgtc cacggaccct 4020 gctgccccta acctcaagag tcagctggct gcagctgcca gggcagtaac tgacagcatc 4080 aatcagctca tcactatgtg cacccagcag gcacccggcc agaaggagtg tgataacgcc 4140 ctgcgggaat tggagacggt ccgggaactc ctggagaacc cagtccagcc catcaatgac 4200 atgtcctact ttggttgcct ggacagtgta atggagaact caaaggtgct gggcgaggcc 4260 atgactggca tctcccaaaa tgccaagaac ggaaacctgc cagagtttgg agatgccatt 4320 tccacagcct caaaggcact ttgtggcttc accgaggcag ctgcacaggc tgcatatctg 4380 gttggtgtct ctgaccccaa tagccaagct ggacagcaag ggctagtgga gcccacacag 4440 tttgcccgtg caaaccaggc aattcagatg gcctgccaga gtttgggaga gcctggctgt 4500 acccaggccc aggtgctctc tgcagccacc attgtggcta aacacacctc tgcactgtgt 4560 aacagctgtc gcctggcttc tgcccgtacc accaatccta ctgccaagcg ccagtttgta 4620 cagtcagcca aggaggtggc caacagcaca gctaatcttg tcaagaccat caaggcgcta 4680 gatggggcct tcacagagga gaaccgtgcc cagtgccgag cagcaacagc ccctctgctg 4740 gaggctgtgg acaatctgag tgcctttgcg tccaaccctg agttctccag cattcctgcc 4800 cagatcagcc ctgagggtcg ggctgccatg gagcccattg tgatctctgc caagacaatg 4860 ttagagagtg ccgggggact catccagaca gcccgggccc tcgcagtcaa tccccgggac 4920 cccccgagct ggtcggtgct ggccggccac tcccgtactg tctcagactc catcaagaag 4980 ctaattacaa gcatgaggga caaggctcca gggcagctgg agtgtgaaac ggccattgca 5040 gctctgaaca gttgtctacg ggacctagac caggcttccc tcgctgcagt cagccagcag 5100 cttgctcccc gtgagggaat ctctcaagag gccttgcaca ctcagatgct cactgcagtc 5160 caagagatct cccatctcat tgagccgctg gccaatgctg cccgggctga agcctcccag 5220 ctgggacaca aggtgtccca gatggcgcag tactttgagc cgctcaccct ggctgcagtg 5280 ggtgctgcct ccaagaccct gagccacccg cagcagatgg cactcctgga ccagactaaa 5340 acattggcag agtctgccct gcagttgcta tacactgcca aggaggctgg tggtaaccca 5400 aagcaagcag ctcacaccca ggaagccctg gaggaggctg tgcagatgat gaccgaggcc 5460 gtagaggacc tgacaacaac cctcaacgag gcagccagtg ctgctggggt cgtgggtggc 5520 atggtggact ccatcaccca ggccatcaac cagctagatg aaggaccaat gggtgaacca 5580 gaaggttcct tcgtggatta ccaaacaact atggtgcgga cagccaaggc cattgcagtg 5640 accgttcagg agatggttac caagtcaaac accagcccag aggagctggg ccctcttgct 5700 aaccagctga ccagtgacta tggccgtctg gcctcggagg ccaagcctgc agcggtggct 5760 gctgaaaatg aagagatagg ttcccatatc aaacaccggg tacaggagct gggccatggc 5820 tgtgccgctc tggtcaccaa ggcaggcgcc ctgcagtgca gccccagtga tgcctacacc 5880 aagaaggagc tcatagagtg tgcccggaga gtctctgaga aggtctccca cgtcctggct 5940 gcgctccagg ctgggaatcg tggcacccag gcctgcatca cagcagccag cgctgtgtct 6000 ggtatcattg ctgacctcga caccaccatc atgttcgcca ctgctggcac gctcaatcgt 6060 gagggtactg aaactttcgc tgaccaccgg gagggcatcc tgaagactgc gaaggtgctg 6120 gtggaggaca ccaaggtcct ggtgcaaaac gcagctggga gccaggagaa gttggcgcag 6180 gctgcccagt cctccgtggc gaccatcacc cgcctcgctg atgtggtcaa gctgggtgca 6240 gccagcctgg gagctgagga ccctgagacc caggtggtac taatcaacgc agtgaaagat 6300 gtagccaaag ccctgggaga cctcatcagt gcaacgaagg ctgcagctgg caaagttgga 6360 gatgaccctg ctgtgtggca gctaaagaac tctgccaagg tgatggtgac caatgtgaca 6420 tcattgctta agacagtaaa agccgtggaa gatgaggcca ccaaaggcac tcgggccctg 6480 gaggcaacca cagaacacat acggcaggag ctggcggttt tctgttcccc agagccacct 6540 gccaagacct ctaccccaga agacttcatc cgaatgacca agggtatcac catggcaacc 6600 gccaaggccg ttgctgctgg caattcctgt cgccaggaag atgtcattgc cacagccaat 6660 ctgagccgcc gtgctattgc agatatgctt cgggcttgca aggaagcagc ttaccaccca 6720 gaagtggccc ctgatgtgcg gcttcgagcc ctgcactatg gccgggagtg tgccaatggc 6780 tacctggaac tgctggacca tgtactgctg accctgcaga agccaagccc agaactgaag 6840 cagcagttga caggacattc aaagcgtgtg gctggttccg tcactgagct catccaggct 6900 gctgaagcca tgaagggaac agaatgggta gacccagagg accccacagt cattgctgag 6960 aatgagctcc tgggagctgc agccgccatt gaggctgcag ccaaaaagct agagcagctg 7020 aagccccggg ccaaacccaa ggaggcagat gagtccttga actttgagga gcagatacta 7080 gaagctgcca agtccattgc agcagccacc agtgcactgg taaaggctgc gtcggctgcc 7140 cagagagaac tagtggccca agggaaggtg ggtgccattc cagccaatgc actggacgat 7200 gggcagtggt cccagggcct catttctgct gcccggatgg tggctgcggc caccaacaat 7260 ctgtgtgagg cagccaatgc agctgtacaa ggccatgcca gccaggagaa gctcatctca 7320 tcagccaagc aggtagctgc ctccacagcc cagctccttg tggcctgcaa ggtcaaggct 7380 gaccaggact cggaggcaat gaaacgactt caggctgctg gcaacgcagt gaagcgagcc 7440 tcagataatc tggtgaaagc agcacagaag gctgcagcct ttgaagagca ggagaatgag 7500 acagtggtgg tgaaagagaa gatggttggc ggcattgccc agatcatcgc agcacaggaa 7560 gaaatgcttc ggaaggaacg agagctggaa gaggcgcgga agaaactggc ccagatccgg 7620 cagcagcagt acaagtttct gccttcagag cttcgagatg agcactaaag aagcctcttc 7680 tatttaatgc agacccggcc cagagactgt gcgtgccact accaaagcct tctgggctgt 7740 cggggcccaa cctgcccaac cccagcactc cccaaagtgc ctgccaaacc ccagggcctg 7800 gccccgccca gtcccgcagt acatcccctg tcccctcccc aaccccaagt gccttcatgc 7860 cctagggccc cccaagtgcc tgcccctccc cagagtatta acgctccaag agtattatta 7920 acgctgctgt acctcgatct gaatctgccg gggccccagc ccactccacc ctgccagcag 7980 cttccggcca gtccc 7995 334 4139 DNA Homo sapiens 334 ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca 60 tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac 120 agtgcttaca gttgttacag gttctggtca tgcaagctct accccaggtg gagaaaagga 180 gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg ctgtgagtat 240 gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca ctcagggaca 300 ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca cctggggaca 360 ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc cgccagccca 420 cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc ccccagccca 480 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 540 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 600 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 660 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 720 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 780 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 840 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 900 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 960 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1020 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1080 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1140 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1200 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1260 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1320 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1380 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1440 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1500 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1560 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1620 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1680 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1740 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1800 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1860 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1920 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1980 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2040 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2100 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2160 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2220 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2280 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2340 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2400 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2460 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2520 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2580 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2640 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2700 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2760 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2820 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2880 tggtgtcacc tcggccccgg acaacaggcc cgccttgggc tccaccgccc ctccagtcca 2940 caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg tgcacaacgg 3000 cacctctgcc agggctacca caaccccagc cagcaagagc actccattct caattcccag 3060 ccaccactct gatactccta ccacccttgc cagccatagc accaagactg atgccagtag 3120 cactcaccat agctcggtac ctcctctcac ctcctccaat cacagcactt ctccccagtt 3180 gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc agtttaattc 3240 ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca tttctgaaat 3300 gtttttgcag atttataaac aagggggttt tctgggcctc tccaatatta agttcaggcc 3360 aggatctgtg gtggtacaat tgactctggc cttccgagaa ggtaccatca atgtccacga 3420 cgtggagaca cagttcaatc agtataaaac ggaagcagcc tctcgatata acctgacgat 3480 ctcagacgtc agcgtgagtg atgtgccatt tcctttctct gcccagtctg gggctggggt 3540 gccaggctgg ggcatcgcgc tgctggtgct ggtctgtgtt ctggttgcgc tggccattgt 3600 ctatctcatt gccttggctg tctgtcagtg ccgccgaaag aactacgggc agctggacat 3660 ctttccagcc cgggatacct accatcctat gagcgagtac cccacctacc acacccatgg 3720 gcgctatgtg ccccctagca gtaccgatcg tagcccctat gagaaggttt ctgcaggtaa 3780 cggtggcagc agcctctctt acacaaaccc agcagtggca gccgcttctg ccaacttgta 3840 gggcacgtcg ccgctgagct gagtggccag ccagtgccat tccactccac tcaggttctt 3900 caggccagag cccctgcacc ctgtttgggc tggtgagctg ggagttcagg tgggctgctc 3960 acagcctcct tcagaggccc caccaatttc tcggacactt ctcagtgtgt ggaagctcat 4020 gtgggcccct gaggctcatg cctgggaagt gttgtggggg ctcccaggag gactggccca 4080 gagagccctg agatagcggg gatcctgaac tggactgaat aaaacgtggt ctcccactg 4139 335 2217 DNA Homo sapiens 335 agaggggccc cgcgcgcgga tctcgcgaga gcattagagg gcggaagcgc tatccgagca 60 ggatgcggtt cgtggttgcc ttggtcctcc tgaacgtcgc agcggcggga gccgtgccgc 120 tcttggccac cgaaagcgtc aagcaagaag aagctggagt acggccttct gcaggaaacg 180 tctccaccca ccccagcttg agccaacggc ctggaggctc taccaagtcg catccggagc 240 cgcagactcc aaaagacagc cctagcaagt cgagtgcgga ggcgcagacc ccagaagaca 300 cccccaacaa gtcgggtgcg gaggcaaaga cccaaaaaga cagctccaac aagtcgggtg 360 cggaggcaaa gacccaaaaa ggcagcacta gcaagtcggg ttcggaggcg cagaccacaa 420 aagacagcac tagtaagtcg catccggagc tgcagactcc aaaagacagc actggcaaat 480 cgggtgcgga ggcgcagacc ccagaagaca gccccaacag gtcgggtgcg gaggcaaaga 540 cccaaaaaga cagccctagc aagtcaggtt cggaggcgca gaccacaaaa gatgtcccta 600 ataagtcggg tgcggacggc cagaccccaa aagacggctc cagcaagtcg ggtgcggagg 660 atcagacccc aaaagacgtc cctaacaagt cgggtgcgga gaagcagact ccaaaagacg 720 gctctaacaa gtccggtgca gaggagcagg cccaatagac gggcccagca agtcgggtgc 780 ggaggagcag acctcaaaag acagccctaa caaggtggtt ccagagcagc cttcctggaa 840 agaccattcc aagcccatct ccaacccttc tgataacaag gagctcccca aggctgacac 900 aaaccagctt gctgacaaag ggaagctttc tcctcatgct ttcaaaaccg aatctgggga 960 ggaaactgac ctcatttctc ccccgcagga ggaagttaag tcttcagagc ctactgagga 1020 tgtggagccc aaagaggctg aagatgatga tacaggaccc gaggagggct caccgcccaa 1080 agaagagaaa gaaaagatgt ccggttctgc ctccagtgag aaccgtgaag gaacactttc 1140 ggattccacg ggtagcgaga aggatgacct ttatccgaac ggttctggaa atggcagcgc 1200 ggagagcagc cacttctttg catatctggt gactgcagcc attcttgtgg ctgtcctcta 1260 tatcgctcat cacaacaagc ggaagatcat tgcttttgtc ctggaaggaa aaagatctaa 1320 agtcacccgg cggccaaagg ccagtgacta ccaacgtttg gaccagaagt cctaacagaa 1380 tggtatattc ctctggaaaa agatgaacgt caccaatgga ttgtgctgct ctcgtttcag 1440 ctttgatttt tttgtccttg agaaccttgt cctccctgct gatttgtttc taaatcaaaa 1500 gaaatgaaga aaaaagtact gtgacctgag agacaccctc ctctagaatt tagtggcggg 1560 tctgggctgg cagaggtagg gggctgcttt gggctttgca cctgcacttt ggtgacattg 1620 ttcttctgtg ttccctttat ttatgctggt ggcttccatc cgttcctcct ctgagggtga 1680 gtggagaggt atatggaaac acggctgtga ccaaagggag atcccagcct gggcaggctg 1740 cgctgctgac caccctccct ggggcccggg ctctgtagga aagttggtcc ttgactgtgg 1800 cattgcactc tgcactgttt ctctctgcag acctagggga aaactgcagg tggaagtgct 1860 tttctactaa ggcctcttac tttggggggg atgtgcccta cagaagacat agaagatggg 1920 gaaatgccaa tgggcaaaga gctactttga atacataatt ctcttcaaag acttcagcag 1980 caaaccaaaa cagcaggtta aaaaaaaaga tgcttttttg ggtgcaagtc taacctgtct 2040 agcatgagat cttcttgatt ttctgattat tttatgtagc ttgagacaaa gtgaatcaac 2100 ttccacttag ttgtaccgag cataaaacag aacttgggct tcctggcagt gaggccactg 2160 tcccatcaca gatttttaaa ataaatatga tttgaagtag tgtgatcttt cacacaa 2217 336 6773 DNA Homo sapiens 336 gcggctggtt gcgggccggc ggcgggctgg cggagatgga ggatcttgtt caagatgggg 60 tggcttcacc agctacccct gggaccggga aatctaagaa ttggagaaag aaattgaaga 120 actcagatca aaacctgtta ctgaaggaac tggtgatatt attaaggcat taactgaacg 180 tctggatgct cttcttctgg aaaaagcaga gactgagcaa cagtgtcttt ctctgaaaaa 240 ggaaaatata aaaatgaagc aagaggttga ggattctgta acaaagatgg gagatgcaca 300 taaggagttg gaacaatcac atataaacta tgtgaaagaa attgaaaatt tgaaaaatga 360 gttgatggca gtacgttcca aatacagtga agacaaagct aacttacaaa agcagctgga 420 agaagcaatg aatacgcaat tagaactttc agaacaactt aaatttcaga acaactctga 480 agataatgtt aaaaaactac aagaagagat tgagaaaatt aggccaggct ttgaggagca 540 aattttatat ctgcaaaagc aattagacgc taccactgat gaaaagaagg aaacagttac 600 tcaactccaa aatatcattg aggctaattc tcagcattac caaaaaaata ttaatagttt 660 gcaggaagag cttttacagt tgaaagctat acaccaagaa gaggtgaaag agttgatgtg 720 ccagattgaa gcatcagcta aggaacatga agcagagata aataagttga acgagctaaa 780 agagaactta gtaaaacaat gtgaggcaag tgaaaagaac atccagaaga aatatgaatg 840 tgagttagaa aatttaagga aagccacctc aaatgcaaac caagacaatc agatatgttc 900 tattctcttg caagaaaata catttgtaga acaagtagta aatgaaaaag tcaaacactt 960 agaagatacc ttaaaagaac ttgaatctca acacagtatc ttaaaagatg aggtaactta 1020 tatgaataat cttaagttaa aacttgaaat ggatgctcaa catataaagg atgagttttt 1080 tcatgaacgg gaagacttag agtttaaaat taatgaatta ttactagcta aagaagaaca 1140 gggctgtgta attgaaaaat taaaatctga gctagcaggt ttaaataaac agttttgcta 1200 tactgtagaa cagcataaca gagaagtaca gagtcttaag gaacaacatc aaaaagaaat 1260 atcagaacta aatgagacat ttttgtcaga ttcagaaaaa gaaaaattaa cattaatgtt 1320 tgaaatacag ggtcttaagg aacagtgtga aaacctacag caagaaaagc aagaagcaat 1380 tttaaattat gagagtttac gagagattat ggaaatttta caaacagaac tgggggaatc 1440 tgctggaaaa ataagtcaag agttcgaatc aatgaagcaa cagcaagcat ctgatgttca 1500 tgaactgcag cagaagctca gaactgcttt tactgaaaaa gatgcccttc tcgaaactgt 1560 gaatcgcctc cagggagaaa atgaaaagtt actatctcaa caagaattgg taccagaact 1620 tgaaaatacc ataaagaacc ttcaagaaaa gaatggagta tacttactta gtctcagtca 1680 aagagatacc atgttaaaag aattagaagg aaagataaat tctcttactg aggaaaaaga 1740 tgattttata aataaactga aaaattccca tgaagaaatg gataatttcc ataagaaatg 1800 tgaaagggaa gaaagattga ttcttgaact tgggaagaaa gtagagcaaa caatccagta 1860 caacagtgaa ctagaacaaa aggtaaatga attaacagga ggactagagg agactttaaa 1920 agaaaaggat caaaatgacc aaaaactaga aaaacttatg gttcaaatga aagttctctc 1980 tgaagacaaa gaagtattgt cagctgaagt gaagtctctt tatgaggaaa acaataaact 2040 cagttcagaa aaaaaacagt tgagtaggga tttggaggtt tttttgtctc aaaaagaaga 2100 tgttatcctt aaagaacata ttactcaatt agaaaagaaa cttcagttaa tggttgaaga 2160 gcaagataat ttaaataaac tgcttgaaaa tgagcaagtt cagaagttat ttgttaaaac 2220 tcagttgtat ggttttctta aagaaatggg atcagaagtt tcagaagaca gtgaagagaa 2280 agatgttgtt aatgtcctac aggcagtcgg tgaatccttg gcaaaaataa atgaggaaaa 2340 atgcaacctg gcttttcagc gtgatgaaaa agtattagag ttagaaaaag agattaagtg 2400 ccttcaagaa gagagtgtag ttcagtgtga agaacttaag tctttattga gagactatga 2460 gcaagagaaa gttctcttaa ggaaagagtt agaagaaata cagtcagaaa aagaggccct 2520 gcagtctgat cttctagaaa tgaagaatgc taatgaaaaa acaaggcttg aaaatcagaa 2580 tcttttaatt caagttgaag aagtatctca aacatgtagc aaaagtgaaa tccataatga 2640 aaaagaaaaa tgttttataa aggaacatga aaacctaaag ccactactag aacaaaaaga 2700 attacgagat aggagagcag agttgatact attaaaggat tccttagcaa aatcaccttc 2760 tgtaaaaaat gatcctctgt cttcagtaaa agagttggaa gaaaaaatag aaaatctgga 2820 aaaagaatgc aaagaaaagg aggagaaaat aaataagata aaattagttg ccgtaaaggc 2880 aaagaaagaa ctagattcca gcagaaaaga gacccagact gtgaaggaag aacttgaatc 2940 tcttcgatca gaaaaggacc agttatctgc ttccatgaga gatctcattc aaggagcaga 3000 aagctataag aatcttttat tagaatatga aaagcagtca gagcaactgg atgtggaaaa 3060 agaacgtgct aataattttg agcatcgtat tgaagacctt acaagacaat taagaaattc 3120 gactttgcag tgtgaaacaa taaattctga taatgaagat ctcctggctc gtattgagac 3180 attacagtct aatgccaaat tattagaagt acagatttta gaagtccaga gagccaaagc 3240 aatggtagac aaagaattag aagctgaaaa acttcagaaa gaacagaaga taaaggaaca 3300 tgccactact gtaaatgaac ttgaagaact tcaggtacaa cttcaaaagg aaaagaaaca 3360 gcttcagaaa accatgcaag aattagagct ggttaaaaag gatgcccaac aaaccacatt 3420 gatgaatatg gaaatagctg attatgaacg tttgatgaaa gaactaaatc aaaagttaac 3480 taataaaaac aacaagatag aagatttgga gcaagaaata aaaattcaaa aacagaaaca 3540 agaaacccta caagaagaaa taacttcatt acagtcttca gtacaacaat atgaagaaaa 3600 aaacaccaaa atcaagcaat tgcttgtgaa aaccaaaaag gaactggcag attcaaagca 3660 agcagaaact gatcacttaa tacttcaagc atctttaaaa ggtgagctgg aggcaagcca 3720 gcagcaagta gaagtctata aaatacagct ggctgaaata acatcagaga agcacaaaat 3780 ccacgagcac ctgaaaacct ctgcggaaca gcaccagcgt acgctaagtg cataccagca 3840 gagagtgaca gcactacagg aagagtgccg tgctgccaag gcagaacaag ctactgtaac 3900 ctctgaattc gagagctaca aagtccgagt tcataatgtt ctaaaacaac agaaaaataa 3960 atctatgtct caggctgaaa ctgagggcgc taaacaagaa agggaacatc tggaaatgct 4020 gattgaccag ctaaaaatca aattacaaga tagccaaaat aacttacaga ttaatgtatc 4080 tgaacttcaa acattgcagt ctgaacatga tacactgcta gaaaggcaca acaagatgct 4140 gcaggaaact gtgtccaaag aggcggaact ccgggaaaaa ttgtgttcaa tacagtcaga 4200 gaacatgatg atgaaatctg aacatacaca gactgtgagt cagctaacat cccagaacga 4260 ggtccttcga aatagcttcc gagatcaagt gcgacatttg caggaagaac acagaaagac 4320 agtggagaca ttacagcagc agctctccaa gatggaagca cagctcttcc agcttaagaa 4380 tgaaccgacc acaagaagcc cagtttcctc tcaacaatct ttgaagaacc ttcgagaaag 4440 gagaaacaca gacctcccgc ttctagacat gcacactgta acccgggaag agggagaagg 4500 catggagaca actgatacgg agtctgtgtc ttccgccagc acatacacac agtctttaga 4560 gcagctgctt aactctcccg aaactaaact tgagcctcca ttatggcatg ctgaatttac 4620 caaagaagaa ttggttcaga agctcagttc caccacaaaa agtgcagatc acttaaacgg 4680 cctgcttcgg gaaacagaag caaccaatgc aattcttatg gagcaaatta agcttctcaa 4740 aagtgaaata agaagattgg aaaggaatca agagcgagag aagtctgcag ctaacctgga 4800 atacttgaag aacgtcttgc tgcagttcat tttcttgaaa ccaggtagtg aaagagagag 4860 acttcttcct gttataaata cgatgttgca gctcagccct gaagaaaagg gaaaacttgc 4920 tgcggttgct caaggtgagg aagaaaatgc ttcccgttct tctggatggg catcctatct 4980 tcatagttgg tctggacttc gataggttga tggaaggaat atttttatta accaaataga 5040 atctatttac aaaaatggtt cacgtatatt accacaattc ttttgtcaaa aagtgtgtat 5100 atatgtttgc atctacatat atttgtacat ctatatgaca gatgtatttt aaaagtttca 5160 tcttgaagta aaagtacaac agcttgaagt gttgatagca ggccacagcc ctctaactca 5220 tgtgatttcc catgcatgct gccagaataa aaccaccagg aatgaattca ctccccactt 5280 ctctggaacc tcaggacccg cccatttctc ggcagtactg tgaattttga agttaaacta 5340 aattttggta ccataccaac tggaatttag gctttaaaaa taatgtttca aggccaggtg 5400 tggtgattca tgcctgaaat cccactactt tgggaggctg aggctggaga attgcttgag 5460 gctagtgagc tgtgactccc actgcactcc agctcgggga acagagcgag accttgtctc 5520 taaaaataat agtaataaaa taaaaataac gttttatgac tatttattgc aaggtcagag 5580 ttacagattg ttataaattg ttgagaaatt tttgtgatta gaatatgaag gaaaaagctt 5640 tgttggtaaa agtgacatgt taaggggcta tgaagtaaat atgctgcagt taattgtgct 5700 aagttaaaat acagtttagt tatttgcttt aaaataaact cttctttttt tctttaaagt 5760 atactatctc aaaactcatt atgttgtcag agccctagag ctggctagtg taacactgac 5820 tatgagtagg tgggcccacc acttgagttg aggtgatttc atggtgtctt tccaggctct 5880 tgatagggtg tcactgcatg caagccatga atctgttttg agaatcctct ccattttccc 5940 aaataaaaac ctatcacaac agtgactata tcactcagca ttggatctaa atataaaagt 6000 ggtgctttca gtgtttttgg cagatagtgt tccataagct ttccatcaga agggatttta 6060 gacaccttag aggtccgtgc tacatcgtca cagttcctcc gaataacctt aggtggtagt 6120 gttacttgcc tttgacacct ctgcatatgt tttaatgact agatccaaac tgtgttgttc 6180 ttaaatcaaa aattggataa tttgtaatat ttatgtgtta atcacacagt atgctctctg 6240 aagttctctt aagccttcag tttatactct taatttaatt ttctttctga gctggagaac 6300 tggctttgca ctttggttac acagaacatt ggtttccaat tcagtttaac tgaaatttgc 6360 tgctgatatg ttgagtttgt tctttaaaaa atagctcata tatctcatct ttcctcctgt 6420 cttagaagaa cagacctaac tagtgaatgt attaatgaaa atgcatctat ttcagagctg 6480 acatgaagag tttagttttt ttactttata aactgtgaat atgagtatgc cagctgcata 6540 cgatgtaact aatcatattt aaatatattt cactttctct ttgactttag accttttgaa 6600 gtctgtataa acttgttttg aaatatagtc tctgcttacg aatgtcataa caaaataatt 6660 ttttgcatga taaaaaatta ctttgattac aaaaggcgta ttctttcatg gtttctgcaa 6720 tgagaggaag tgtaatgatt attttaatat ttctattaaa tatgtttaac tgt 6773 337 4372 DNA Homo sapiens 337 cacgactcca cacgcgcgca cgcagccagc gagcggccgg agcggacggc agacggggcg 60 ggcggcgtca gggtcgcagc gtctacagct gctcgggggc ggtttcttgg cggaggcttg 120 gccggctcct ctctcccggc tccgcggcgg ctgcgaaggc ggcggctcct gccctctcgc 180 tttccctctc gcgtctctgg ctgcaggtga aaggaaagca agccaggatg gatatttacg 240 acactcaaac cttgggggtt gtggtctttg gaggattcat ggttgtttct gccattggca 300 tcttcctggt gtcgactttc tccatgaagg aaacgtcata tgaagaagcc ctagccaacc 360 agcgcaagga gatggcgaaa actcaccacc agaaagtcga gaagaaaaag aaggagaaaa 420 cagtggagaa gaaaggaaag accaagaaaa aggaagagaa acctaatggg aagatacctg 480 atcatgatcc agcccccaat gtgactgtcc tccttcgaga accagtgcgg gctcctgctg 540 tggctgtggc tccaacccca gtgcagcccc ccattatcgt tgctcctgtc gccacagttc 600 cagccatgcc ccaggagaag ctggcctcct cccccaagga caaaaagaag aaggagaaaa 660 aagtggcaaa agtggaacca gctgtcagct ctgtagtgaa ttccatccag gttctcactt 720 cgaaggctgc catcttggaa actgctccca aggaggtgcc gatggtggtg gtgcccccag 780 tgggtgccaa gggcaacaca ccagccactg gcactactca gggcaaaaag gcggagggga 840 ctcagaatca aagcaaaaag gctgaaggag ccccaaacca gggcagaaag gcagagggaa 900 ccccaaacca gggcaaaaag acagagggaa ccccaaacca agggaaaaag gcagagggaa 960 ccccaaacca aggcaaaaag gcagaaggaa ccccaaacca aggcaaaaag gcggaggggg 1020 cccagaacca gggtaaaaag gtagatacaa ccccaaacca ggggaaaaag gtggaggggg 1080 ccccaaccca gggcagaaag gccgaggggg ctcagaacca ggccaaaaag gtagaagggg 1140 cccagaacca gggcaaaaag gcagaggggg cccagaatca gggcaaaaag ggagaggggg 1200 cccagaacca gggcaagaag gccgaggggg cccagaatca gggcaagaag gccgaggggg 1260 cccagaatca gggcaagaag gccgaggggg cccagaatca gggcaagaag gccgaggggg 1320 cccagaatca gggcaagaag gctgaggggg ctcagaacca gggcaaaaag gccgaggggg 1380 ctcagaacca gggcaaaaaa gtagaagggg cccagaacca gggcaagaag gctgagggtg 1440 cccagaacca gggcaaaaag gccgaggggg cccagaatca gggcaaaaag gccgaggggg 1500 cccagaacca gggcaagaag gcagaggggg cccagaacca gggcaagaag gccgaggggg 1560 cccagaacca ggacaagaag gccgaggggg cccagaacca gggcaggaag gccgaggggg 1620 cccagaacca gggcaggaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 1680 cccagaacca gggcaagaag gccgaggggg ccccgaacca gggcaagaag gccgagggga 1740 ccccgaacca gggcaagaag gccgagggga ccccgaacca gggcaagaag gccgagggga 1800 ccccgaacca gggcaagaag gccgagggga ccccgaacca gggcaagaag gccgaggggg 1860 cccagaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgagggga 1920 ccccgaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 1980 cccagaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 2040 cccagaacca gggcaagaag gccgaggggg cccagaacca gggcaagaag gccgaggggg 2100 cccagaacca gggcaagaag gccgagggtg ctcagaacca gggcaaaaaa gtagaagggg 2160 cccagaacca gggcaagaag gctgaggggg cccagaacca gggcaagaag gccgaggggg 2220 ctcagaacca gggcaaaaag gccgagggag cccagaacca gggccaaaaa ggagagggag 2280 cccagaatca gggtaaaaag acagaagggg ctcagggcaa aaaggcagaa aggagtccca 2340 accaaggcaa aaaaggagag ggagctccca tccagggcaa aaaggcagat tcggttgcta 2400 atcagggcac aaaggtagag ggtattacaa accaggggaa aaaagcagaa gggtccccca 2460 gtgaaggcaa aaaggcagaa gggtccccca accaaggcaa aaaggcagac gcagctgcca 2520 atcagggtaa aaagacagag tcagcttctg tccagggcag aaatacagat gtggcccaga 2580 gcccagaggc accaaagcaa gaggctcctg ccaagaagaa gtctggttca aagaaaaaag 2640 gtgagcctgg gcccccagat gccgacggcc ctctctacct cccctacaag acgctggtct 2700 ccacggttgg gagcatggtg ttcaacgagg gcgaggccca gcggctcatc gagatcctgt 2760 ctgagaaggc tggcatcatt caggacacct ggcacaaggc cactcagaag ggtgaccctg 2820 tggcgattct gaaacgccag ctggaagaga aggaaaaact gctggccaca gaacaggaag 2880 atgcggctgt cgccaagagc aaactgaggg agctcaacaa ggagatggca gcagaaaagg 2940 ccaaagcagc agccggggag gccaaagtga aaaagcagct ggtggcccgg gagcaggaga 3000 tcacggctgt gcaggcacgc atgcaggcca gctaccggga gcacgtgaag gaggtgcagc 3060 agctgcaggg caagatccgg actcttcagg agcagctgga gaatggcccc aacacgcagc 3120 tggcccgcct gcagcaggag aactccatcc tgcgggatgc cttgaaccag gccacgagcc 3180 aggtggagag caagcagaac gcagagctgg ccaagcttcg gcaggagctc agcaaggtca 3240 gcaaagagct ggtggagaag tcagaggctg tgcggcaaga tgagcagcag cggaaagctc 3300 tggaagccaa ggcagctgcc ttcgagaagc aggtcctgca gctgcaggcg tcccacaggg 3360 agagtgagga ggccctgcag aagcgcctgg acgaggtcag ccgggagctg tgccacacgc 3420 agagcagcca cgccagcctc cgggcggatg ccgagaaggc ccaggagcaa cagcagcaga 3480 tggccgagct gcacagcaag ttacagtcct ccgaggcgga ggtgcgcagc aaatgcgagg 3540 agctgagtgg cctccacggg cagctccagg aggccagggc ggagaactcc cagctcacag 3600 agagaatccg ttccattgag gccctgctgg aggcgggcca ggcgcgggat gcccaggacg 3660 tccaggccag ccaggcggag gctgaccagc agcagactcg cctcaaggag ctggagtccc 3720 aggtgtcggg tctggagaag gaggccatcg agctcaggga ggccgtcgag cagcagaaag 3780 tgaagaacaa tgacctccgg gagaagaact ggaaggccat ggaggcactg gccacggccg 3840 agcaggcctg caaggagaag ctgcactccc tgacccaggc caaggaggaa tcggagaagc 3900 agctctgtct gattgaggcg cagaccatgg aggccctgct ggctctgctc ccagaactct 3960 ctgtcttggc acaacagaat tacaccgagt ggctgcagga tctcaaagag aaaggcccca 4020 cgctgctgaa gcacccgcca gctcccgcgg agccctcctc ggacctggcc tccaagttga 4080 gggaggccga ggagacgcag agcacactgc aggccgagtg tgaccagtac cgcagcatcc 4140 tggcggagac ggagggcatg ctcagagacc tgcagaagag cgtggaggag gaggagcagg 4200 tgtggagggc caaggtgggc gccgcagagg aggagctcca gaagtcccgg gtcacagtga 4260 agcatctcga agagattgta gagaagctaa aaggagaact tgaaagttcg gaccaggtga 4320 gggagcacac gttgcatttg gaggcagagc tggaaaagca catggcggcc gc 4372 338 3635 DNA Homo sapiens 338 tcaacacagg acaatgcaag cccatgagct gttccggtat tttcgaatgc cagagctggt 60 tgacttccga cagtacgtgc gtactcttcc gaccaacacg cttatgggct tcggagcttt 120 tgcagcactc accaccttct ggtacgccac gagacccaaa cccctgaagc cgccatgcga 180 cctctccatg cagtcagtgg aagtggcggg tagtggtggt gcacgaagat ccgcactact 240 tgacagcgac gagcccttgg tgtatttcta tgatgatgtc acaacattat acgaaggttt 300 ccagagggga atacaggtgt caaataatgg cccttgttta ggctctcgga aaccagacca 360 accctatgaa tggctttcat ataaacaggt tgcagaattg tcggagtgca taggctcagc 420 actgatccag aagggcttca agactgcccc agatcagttc attggcatct ttgctcaaaa 480 tagacctgag tgggtgatta ttgaacaagg atgctttgct tattcgatgg tgatcgttcc 540 actttatgat acccttggaa atgaagccat cacgtacata gtcaacaaag ctgaactctc 600 tctggttttt gttgacaagc cagagaaggc caaactctta ttagagggtg tagaaaataa 660 gttaatacca ggccttaaaa tcatagttgt catggatgcc tacggcagtg aactggtgga 720 acgaggccag aggtgtgggg tggaagtcac cagcatgaag gcgatggagg acctgggaag 780 agccaacaga cggaagccca agcctccagc acctgaagat cttgcagtaa tttgtttcac 840 aagtggaact acaggcaacc ccaaaggagc aatggtcact caccgaaaca tagtgagcga 900 ttgttcagct tttgtgaaag caacagagaa tacagtcaat ccttgcccag atgatacttt 960 gatatctttc ttgcctctcg cccatatgtt tgagagagtt gtagagtgtg taatgctgtg 1020 tcatggagct aaaatcggat ttttccaagg agatatcagg ctgctcatgg atgacctcaa 1080 ggtgcttcaa cccactgtct tccccgtggt tccaagactg ctgaaccgga tgtttgaccg 1140 aattttcgga caagcaaaca ccacgctgaa gcgatggctc ttggactttg cctccaagag 1200 gaaagaagca gagcttcgca gcggcatcat cagaaacaac agcctgtggg accggctgat 1260 cttccacaaa gtacagtcga gcctgggcgg aagagtccgg ctgatggtga caggagccgc 1320 cccggtgtct gccactgtgc tgacgttcct cagagcagcc ctgggctgtc agttttatga 1380 aggatacgga cagacagagt gcactgccgg gtgctgccta accatgcctg gagactggac 1440 cgcaggccat gttggggccc cgatgccgtg caatttgata aaacttgttg atgtggaaga 1500 aatgaattac atggctgccg agggcgaggg cgaggtgtgt gtgaaagggc caaatgtatt 1560 tcagggctac ttgaaggacc cagcgaaaac agcagaagct ttggacaaag acggctggtt 1620 acacacaggg gacattggaa aatggttacc aaatggcacc ttgaaaatta tcgaccggaa 1680 aaagcacata tttaagctgg cacaaggaga atacatagcc cctgaaaaga ttgaaaatat 1740 ctacatgcga agtgagcctg ttgctcaggt gtttgtccac ggagaaagcc tgcaggcatt 1800 tctcattgca attgtggtac cagatgttga gacattatgt tcctgggccc aaaagagagg 1860 atttgaaggg tcgtttgagg aactgtgcag aaataaggat gtcaaaaaag ctatcctcga 1920 agatatggtg agacttggga aggattctgg tctgaaacca tttgaacagg tcaaaggcat 1980 cacattgcac cctgaattat tttctatcga caatggcctt ctgactccaa caatgaaggc 2040 gaaaaggcca gagctgcgga actatttcag gtcgcagata gatgacctct attccactat 2100 caaggtttag tgtgaagaag aaagctcaga ggaaatggca cagttccaca atctcttctc 2160 ctgctgatgg ccttcatgtt gttaattttg aatacagcaa gtgtagggaa ggaagcgttc 2220 gtgtttgact tgtccattcg gggttcttct cataggaatg ctagaggaaa cagaacaccg 2280 ccttacagtc acctcatgtt gcagaccatg tttatggtaa tacacacttt ccaaaatgag 2340 ccttaaaaat tgtaaagggg atactataaa tgtgctaagt tatttgagac ttcctcagtt 2400 taaaaagtgg gttttaaatc ttctgtctcc ctgcttttct aatcaagggg ttaggacttt 2460 gctatctctg agatgtctgc tacttgctgc aaattctgca gctgtctgct gctctaaaga 2520 gtacagtgca ctagagggaa gtgttccctt taaaaataag aacaactgtc ctggctggag 2580 aatctcacaa gcggaccaga gatcttttta aatccctgct actgtccctt ctcacaggca 2640 ttcacagaac ccttctgatt cgtaagggtt acgaaactca tgttcttctc cagtcccctg 2700 tggtttctgt tggagcataa ggtttccagt aagcgggagg gcagatccaa ctcagaacca 2760 tgcagataag gagcctctgg caaatgggtg ctcatcagaa cgcgtggatt ctctttcatg 2820 gcagaatgct cttggactcg gttctccagg cctgattccc cgactccatc ctttttcagg 2880 ggttatttaa aaatctgcct tagattctat agtgaagaca agcatttcaa gaaagagtta 2940 cctggatcag ccatgctcag ctgtgacgcc tgaataactg tctactttat cttcactgaa 3000 ccactcactc tgtgtaaagg ccaacagatt tttaatgtgg ttttcatatc aaaagatcat 3060 gttgggatta acttgccttt ttccccaaaa aataaactct caggcaagca tttctttaaa 3120 gctattaagg gagtatatac ttgagtactt attgaaatgg acagtaataa gcaaatgttc 3180 ttataatgct acctgatttc tatgaaatgt gtttgacaag ccaaaattct aggatgtaga 3240 aatctggaaa gttcatttcc tgggattcac ttctccaggg attttttaaa gttaatttgg 3300 gaaattaaca gcagttcact ttattgtgag tctttgccac atttgactga attgagctgt 3360 catttgtaca tttaaagcag ctgttttggg gtctgtgaga gtacatgtat tatatacaag 3420 cacaacaggg cttgcactaa agaattgtca ttgtaataac actacttggt agcctaactt 3480 catatatgta ttcttaattg cacaaaaagt caataatttg tcaccttggg gttttgaatg 3540 tttgctttaa gtgttggcta tttctatgtt ttataaacca aaacaaaatt tccaaaaaca 3600 atgaaggaaa ccaaaataaa tatttctgca tttcg 3635 339 2660 DNA Homo sapiens 339 ggcctcgagc gccccggcgg gaggtttttc tatatgagtg gagaagacag ctgttaccag 60 ggaggtcata caacattttt ttaggatgtc tgaagatgaa gaaaaagtga aattacgccg 120 tcttgaacca gctatccaga aattcattaa gatagtaatc ccaacaaacc tggaaaggtt 180 aagaaagcac cagataaata ttgagaagta tcaaaggtgc agaatctggg acaagttgca 240 tgaagagcat atcaatgcag gacgtacagt tcagcaactc cgatccaata tccgagaaat 300 tgagaaactt tgtttgaaag tccgaaagga tgacctagta cttctgaaga gaatgataga 360 tcctgttaaa gaagaagcat cagcagcaac agcagaattt ctccaactcc atttggaatc 420 tgtagaagaa cttaagaagc aatttaatga tgaagaaact ttgctacagc ctcctttgac 480 cagatccatg actgttggtg gagcatttca tactactgaa gctgaagcta gttctcagag 540 tttgactcag atatatgcct tacctgaaat tcctcaagat caaaatgctg cagaatcgcg 600 ggaaacctta gaagcggact taattgaact tagccaactg gtcactgact tctctctcct 660 agtgaactct cagcaggaga agattgacag cattgcagac catgtcaaca gtgctgctgt 720 gaatgttgaa gagggaacca aaaacttagg gaaggctgca aaatacaagc tggcagctct 780 gcctgtggca ggtgcactca tcgggggaat ggtagggggt cctattggcc tccttgcatg 840 cttcaaagtg gcaggaattg cagctgcact tggtggtggg gtgttgggct tcacaggtgg 900 aaaattgata caaagaaaga aacagaaaat gatggagaag ctcacttcca gctgtccaga 960 tcttcccagc caaactgaca agaaatgcag ttaaaaacca aatttcagta ttattggtgc 1020 caacatgtct atcctgagga cctttgctgc tgttggacac tccgtcacct tttggaacac 1080 aagtatatca agatagtggc tactgatgtt caagtgggat tgaagtgtga taaatggata 1140 tattttgttg tttgctgggg tgttcatgga gatgttaaga gattgaggcc ctgggctgag 1200 ggtatataat gtatgtcagg taaagtttga agactgccaa ggagcagatt ttctccctgg 1260 aaatgtgaaa actgaaccta taactctgat aaggacttga gatgtgtaga aacgttgggt 1320 tatggaagac tagtttcttc cataaccctg aattggagac cttaatgcta agtgtagatt 1380 attgaggttt gttagtgagg aaaagaataa gagttcagaa gcctttgtta tcagatagcg 1440 aaatcagggc ctagtgagga gcacaggtcg actacataat ggagtccatt ggcgaaccct 1500 attgcaattt ggtccaacta tatcttctgg tgaaggaaat taatgatgta agaaaatgca 1560 agaggctcaa cttctcttcc aaaaatcttc tggcttctga actcttcctc tgcctctctt 1620 taaataaata acacagaatt tcaagtggta ggagacttat taagccagtc accaagcttg 1680 gtctgtcagc ctgtcttcta acacctcaaa gatcttgtgc cctgtgctgt ccctcccttg 1740 taattatgaa aagttctttg gtttctgggg tgaactctac ccatgtataa tgaggaattc 1800 tctcataacc ttttttgtct tgtctgtcat ctctgttcat cccctcctat aacctctagg 1860 taaaaagaaa agaaaaaaag aaatttcgag atattttcaa cattgttaga gtttgggcta 1920 aaatgagcaa ggagaaaaaa accaccaaga acatttcctg gggcatgttc cagttttgag 1980 gggtgatata tctgccagat agggggtatc tgacccagtc ttcttttcag ctggtctctg 2040 gggggagctg agaactcgct tgctacctca catccttttc cccagacttt ttatctccta 2100 tgcatccctt tgctttctat agctggtgtt tcttccccaa aatggcgttc ccatgcttac 2160 ctttctcaca ttctagacaa tgatggacaa agacgcatgc aagactcaga cccggggaat 2220 ggtgtggtgc taatctcaac acctgacatt cacagcaagc atggcccagc ccaaccgcat 2280 gtctatctca aaccgcagaa aggctttaat actggaaaaa aagaattcaa gactacaggc 2340 agctcccctc tgtaccccaa ctcatttaaa ataggaggaa tcactttttg ccttacttaa 2400 cgcttttttc tgagcacagg gatgggcacc tgcaccccag aaggtgtgag ctgtctctct 2460 gccaggagct aaggttcatt aggggattgg atggtttatc acttctttct ttctgagttt 2520 acttttagta acttttattg atggctacct ttcatgtccc tgtctaaaga gactttctct 2580 ttcatacgtc ttaaatctca tcaatgaaat ccagtgaaac agcaccattt cttagtatca 2640 ttaaataact agaaagtatc 2660 340 698 PRT Homo sapiens 340 Met Gln Ala His Glu Leu Phe Arg Tyr Phe Arg Met Pro Glu Leu Val 5 10 15 Asp Phe Arg Gln Tyr Val Arg Thr Leu Pro Thr Asn Thr Leu Met Gly 20 25 30 Phe Gly Ala Phe Ala Ala Leu Thr Thr Phe Trp Tyr Ala Thr Arg Pro 35 40 45 Lys Pro Leu Lys Pro Pro Cys Asp Leu Ser Met Gln Ser Val Glu Val 50 55 60 Ala Gly Ser Gly Gly Ala Arg Arg Ser Ala Leu Leu Asp Ser Asp Glu 65 70 75 80 Pro Leu Val Tyr Phe Tyr Asp Asp Val Thr Thr Leu Tyr Glu Gly Phe 85 90 95 Gln Arg Gly Ile Gln Val Ser Asn Asn Gly Pro Cys Leu Gly Ser Arg 100 105 110 Lys Pro Asp Gln Pro Tyr Glu Trp Leu Ser Tyr Lys Gln Val Ala Glu 115 120 125 Leu Ser Glu Cys Ile Gly Ser Ala Leu Ile Gln Lys Gly Phe Lys Thr 130 135 140 Ala Pro Asp Gln Phe Ile Gly Ile Phe Ala Gln Asn Arg Pro Glu Trp 145 150 155 160 Val Ile Ile Glu Gln Gly Cys Phe Ala Tyr Ser Met Val Ile Val Pro 165 170 175 Leu Tyr Asp Thr Leu Gly Asn Glu Ala Ile Thr Tyr Ile Val Asn Lys 180 185 190 Ala Glu Leu Ser Leu Val Phe Val Asp Lys Pro Glu Lys Ala Lys Leu 195 200 205 Leu Leu Glu Gly Val Glu Asn Lys Leu Ile Pro Gly Leu Lys Ile Ile 210 215 220 Val Val Met Asp Ala Tyr Gly Ser Glu Leu Val Glu Arg Gly Gln Arg 225 230 235 240 Cys Gly Val Glu Val Thr Ser Met Lys Ala Met Glu Asp Leu Gly Arg 245 250 255 Ala Asn Arg Arg Lys Pro Lys Pro Pro Ala Pro Glu Asp Leu Ala Val 260 265 270 Ile Cys Phe Thr Ser Gly Thr Thr Gly Asn Pro Lys Gly Ala Met Val 275 280 285 Thr His Arg Asn Ile Val Ser Asp Cys Ser Ala Phe Val Lys Ala Thr 290 295 300 Glu Asn Thr Val Asn Pro Cys Pro Asp Asp Thr Leu Ile Ser Phe Leu 305 310 315 320 Pro Leu Ala His Met Phe Glu Arg Val Val Glu Cys Val Met Leu Cys 325 330 335 His Gly Ala Lys Ile Gly Phe Phe Gln Gly Asp Ile Arg Leu Leu Met 340 345 350 Asp Asp Leu Lys Val Leu Gln Pro Thr Val Phe Pro Val Val Pro Arg 355 360 365 Leu Leu Asn Arg Met Phe Asp Arg Ile Phe Gly Gln Ala Asn Thr Thr 370 375 380 Leu Lys Arg Trp Leu Leu Asp Phe Ala Ser Lys Arg Lys Glu Ala Glu 385 390 395 400 Leu Arg Ser Gly Ile Ile Arg Asn Asn Ser Leu Trp Asp Arg Leu Ile 405 410 415 Phe His Lys Val Gln Ser Ser Leu Gly Gly Arg Val Arg Leu Met Val 420 425 430 Thr Gly Ala Ala Pro Val Ser Ala Thr Val Leu Thr Phe Leu Arg Ala 435 440 445 Ala Leu Gly Cys Gln Phe Tyr Glu Gly Tyr Gly Gln Thr Glu Cys Thr 450 455 460 Ala Gly Cys Cys Leu Thr Met Pro Gly Asp Trp Thr Ala Gly His Val 465 470 475 480 Gly Ala Pro Met Pro Cys Asn Leu Ile Lys Leu Val Asp Val Glu Glu 485 490 495 Met Asn Tyr Met Ala Ala Glu Gly Glu Gly Glu Val Cys Val Lys Gly 500 505 510 Pro Asn Val Phe Gln Gly Tyr Leu Lys Asp Pro Ala Lys Thr Ala Glu 515 520 525 Ala Leu Asp Lys Asp Gly Trp Leu His Thr Gly Asp Ile Gly Lys Trp 530 535 540 Leu Pro Asn Gly Thr Leu Lys Ile Ile Asp Arg Lys Lys His Ile Phe 545 550 555 560 Lys Leu Ala Gln Gly Glu Tyr Ile Ala Pro Glu Lys Ile Glu Asn Ile 565 570 575 Tyr Met Arg Ser Glu Pro Val Ala Gln Val Phe Val His Gly Glu Ser 580 585 590 Leu Gln Ala Phe Leu Ile Ala Ile Val Val Pro Asp Val Glu Thr Leu 595 600 605 Cys Ser Trp Ala Gln Lys Arg Gly Phe Glu Gly Ser Phe Glu Glu Leu 610 615 620 Cys Arg Asn Lys Asp Val Lys Lys Ala Ile Leu Glu Asp Met Val Arg 625 630 635 640 Leu Gly Lys Asp Ser Gly Leu Lys Pro Phe Glu Gln Val Lys Gly Ile 645 650 655 Thr Leu His Pro Glu Leu Phe Ser Ile Asp Asn Gly Leu Leu Thr Pro 660 665 670 Thr Met Lys Ala Lys Arg Pro Glu Leu Arg Asn Tyr Phe Arg Ser Gln 675 680 685 Ile Asp Asp Leu Tyr Ser Thr Ile Lys Val 690 695 341 302 PRT Homo sapiens 341 Met Ser Glu Asp Glu Glu Lys Val Lys Leu Arg Arg Leu Glu Pro Ala 5 10 15 Ile Gln Lys Phe Ile Lys Ile Val Ile Pro Thr Asp Leu Glu Arg Leu 20 25 30 Arg Lys His Gln Ile Asn Ile Glu Lys Tyr Gln Arg Cys Arg Ile Trp 35 40 45 Asp Lys Leu His Glu Glu His Ile Asn Ala Gly Arg Thr Val Gln Gln 50 55 60 Leu Arg Ser Asn Ile Arg Glu Ile Glu Lys Leu Cys Leu Lys Val Arg 65 70 75 80 Lys Asp Asp Leu Val Leu Leu Lys Arg Met Ile Asp Pro Val Lys Glu 85 90 95 Glu Ala Ser Ala Ala Thr Ala Glu Phe Leu Gln Leu His Leu Glu Ser 100 105 110 Val Glu Glu Leu Lys Lys Gln Phe Asn Asp Glu Glu Thr Leu Leu Gln 115 120 125 Pro Pro Leu Thr Arg Ser Met Thr Val Gly Gly Ala Phe His Thr Thr 130 135 140 Glu Ala Glu Ala Ser Ser Gln Ser Leu Thr Gln Ile Tyr Ala Leu Pro 145 150 155 160 Glu Ile Pro Gln Asp Gln Asn Ala Ala Glu Ser Trp Glu Thr Leu Glu 165 170 175 Ala Asp Leu Ile Glu Leu Ser Gln Leu Val Thr Asp Phe Ser Leu Leu 180 185 190 Val Lys Ser Gln Gln Glu Lys Ile Asp Ser Ile Ala Asp His Val Asn 195 200 205 Ser Ala Ala Val Asn Val Glu Glu Gly Thr Lys Asn Leu Gly Lys Ala 210 215 220 Ala Lys Tyr Lys Leu Ala Ala Leu Pro Val Ala Gly Ala Leu Ile Gly 225 230 235 240 Gly Met Val Gly Gly Pro Ile Gly Leu Leu Ala Gly Phe Lys Val Ala 245 250 255 Gly Ile Ala Ala Ala Leu Gly Gly Gly Val Leu Gly Phe Thr Gly Gly 260 265 270 Lys Leu Ile Gln Arg Lys Lys Gln Lys Met Met Glu Lys Leu Thr Ser 275 280 285 Ser Cys Pro Asp Leu Pro Ser Gln Thr Asp Lys Lys Cys Ser 290 295 300 

What is claimed:
 1. An isolated polynucleotide comprising a sequence selected from the group consisting of: (a) sequences provided in SEQ ID NOs:1-168 and 173-339; (b) complements of the sequences provided in SEQ ID NOs:1-168 and 173-339; (c) sequences consisting of at least 20 contiguous residues of a sequence provided in SEQ ID NOs:1-168 and 173-339; (d) sequences that hybridize to a sequence provided in SEQ ID NOs:1-168 and 173-339, under highly stringent conditions; (e) sequences having at least 75% identity to a sequence of SEQ ID NOs: 1-168 and 173-339; (f) sequences having at least 90% identity to a sequence of SEQ ID NOs:1-168 and 173-339; and (g) degenerate variants of a sequence provided in SEQ ID NOs:1-168 and 173-339.
 2. An isolated polypeptide comprising an amino acid sequence selected from the group consisting of: (a) sequences encoded by a polynucleotide of claim 1; (b) sequences having at least 70% identity to a sequence encoded by a polynucleotide of claim 1; and (c) sequences having at least 90% identity to a sequence encoded by a polynucleotide of claim
 1. 3. An expression vector comprising a polynucleotide of claim 1 operably linked to an expression control sequence.
 4. A host cell transformed or transfected with an expression vector according to claim
 3. 5. An isolated antibody, or antigen-binding fragment thereof, that specifically binds to a polypeptide of claim
 2. 6. A method for detecting the presence of a cancer in a patient, comprising the steps of: (a) obtaining a biological sample from the patient; (b) contacting the biological sample with a binding agent that binds to a polypeptide of claim 2; (C) detecting in the sample an amount of polypeptide that binds to the binding agent; and (d) comparing the amount of polypeptide to a predetermined cut-off value and therefrom determining the presence of a cancer in the patient.
 7. A fusion protein comprising at least one polypeptide according to claim
 2. 8. An oligonucleotide that hybridizes to a sequence recited in SEQ ID NOs:1-168 and 173-339 under highly stringent conditions.
 9. A method for stimulating and/or expanding T cells specific for a tumor protein, comprising contacting T cells with at least one component selected from the group consisting of: (a) polypeptides according to claim 2; (b) polynucleotides according to claim 1; and (c) antigen-presenting cells that express a polynucleotide according to claim 1, under conditions and for a time sufficient to permit the stimulation and/or expansion of T cells.
 10. An isolated T cell population, comprising T cells prepared according to the method of claim
 9. 11. A composition comprising a first component selected from the group consisting of physiologically acceptable carriers and immunostimulants, and a second component selected from the group consisting of: (a) polypeptides according to claim 2; (b) polynucleotides according to claim 1; (c) antibodies according to claim 5; (d) fusion proteins according to claim 7; (e) T cell populations according to claim 10; and (f) antigen presenting cells that express a polypeptide according to claim
 2. 12. A method for stimulating an immune response in a patient, comprising administering to the patient a composition of claim
 11. 13. A method for the treatment of ovarian cancer in a patient, comprising administering to the patient a composition of claim
 11. 14. A method for determining the presence of a cancer in a patient, comprising the steps of: (a) obtaining a biological sample from the patient; (b) contacting the biological sample with an oligonucleotide according to claim 8; (c) detecting in the sample an amount of a polynucleotide that hybridizes to the oligonucleotide; and (d) comparing the amount of polynucleotide that hybridizes to the oligonucleotide to a predetermined cut-off value, and therefrom determining the presence of the cancer in the patient.
 15. A diagnostic kit comprising at least one oligonucleotide according to claim
 8. 16. A diagnostic kit comprising at least one antibody according to claim 5 and a detection reagent, wherein the detection reagent comprises a reporter group.
 17. A method for the treatment of ovarian cancer in a patient, comprising the steps of: (a) incubating CD4+ and/or CD8+ T cells isolated from a patient with at least one component selected from the group consisting of: (i) polypeptides according to claim 2; (ii) polynucleotides according to claim 1; and (iii) antigen presenting cells that express a polypeptide of claim 2, such that T cells proliferate; (b) administering to the patient an effective amount of the proliferated T cells, and thereby inhibiting the development of a cancer in the patient. 